HachiML commited on
Commit
a4f1007
1 Parent(s): 9836a48

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md CHANGED
@@ -10,6 +10,40 @@ model_type: mistral
10
 
11
  # Swallow-MS-7b-v0.1
12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  Our Swallow-MS-7b-v0.1 model has undergone continual pre-training from the Mistral-7B-v0.1, primarily with the addition of Japanese language data.
14
 
15
  # Model Release Updates
 
10
 
11
  # Swallow-MS-7b-v0.1
12
 
13
+ このモデルは[tokyotech-llm/Swallow-MS-7b-instruct-v0.1](https://huggingface.co/tokyotech-llm/Swallow-MS-7b-instruct-v0.1/commits/main)のtokenizer.chat_templateを以下に変更したものです。
14
+ ```
15
+ tokenizer.chat_template = """{% if messages[0]['role'] == 'system' %}
16
+ {% set loop_messages = messages[1:] %}
17
+ {% set system_message = messages[0]['content'] %}
18
+ {% elif false == true and not '<<SYS>>' in messages[0]['content'] %}
19
+ {% set loop_messages = messages %}
20
+ {% set system_message = 'あなたは誠実で優秀な日本人のアシスタントです。' %}
21
+ {% else %}
22
+ {% set loop_messages = messages %}
23
+ {% set system_message = false %}
24
+ {% endif %}
25
+ {{ bos_token }}
26
+ {% for message in loop_messages %}
27
+ {% if (message['role'] == 'user') != ((loop.index0 + messages[0]['role'] == 'assistant') % 2 == 0) %}
28
+ {{ raise_exception('Conversation roles must alternate starting from the first role.') }}
29
+ {% endif %}
30
+ {% if loop.index0 == 0 and system_message != false %}
31
+ {% set content = '<<SYS>>\n' + system_message + '\n<</SYS>>\n\n' + message['content'] %}
32
+ {% else %}
33
+ {% set content = message['content'] %}
34
+ {% endif %}
35
+ {% if message['role'] == 'user' %}
36
+ {{ '[INST] ' + content.strip() + ' [/INST] ' }}
37
+ {% elif message['role'] == 'system' %}
38
+ {{ '<<SYS>>\n' + content.strip() + '\n<</SYS>>\n\n' }}
39
+ {% elif message['role'] == 'assistant' %}
40
+ {{ '' + content.strip() + '' + eos_token }}
41
+ {% endif %}
42
+ {% endfor %}
43
+ """
44
+ ```
45
+ 元のモデルのrevisionは`8b17f1c87697fb354952fa0d1018568e50bdff56`です。
46
+
47
  Our Swallow-MS-7b-v0.1 model has undergone continual pre-training from the Mistral-7B-v0.1, primarily with the addition of Japanese language data.
48
 
49
  # Model Release Updates