Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,40 @@ model_type: mistral
|
|
10 |
|
11 |
# Swallow-MS-7b-v0.1
|
12 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
Our Swallow-MS-7b-v0.1 model has undergone continual pre-training from the Mistral-7B-v0.1, primarily with the addition of Japanese language data.
|
14 |
|
15 |
# Model Release Updates
|
|
|
10 |
|
11 |
# Swallow-MS-7b-v0.1
|
12 |
|
13 |
+
このモデルは[tokyotech-llm/Swallow-MS-7b-instruct-v0.1](https://huggingface.co/tokyotech-llm/Swallow-MS-7b-instruct-v0.1/commits/main)のtokenizer.chat_templateを以下に変更したものです。
|
14 |
+
```
|
15 |
+
tokenizer.chat_template = """{% if messages[0]['role'] == 'system' %}
|
16 |
+
{% set loop_messages = messages[1:] %}
|
17 |
+
{% set system_message = messages[0]['content'] %}
|
18 |
+
{% elif false == true and not '<<SYS>>' in messages[0]['content'] %}
|
19 |
+
{% set loop_messages = messages %}
|
20 |
+
{% set system_message = 'あなたは誠実で優秀な日本人のアシスタントです。' %}
|
21 |
+
{% else %}
|
22 |
+
{% set loop_messages = messages %}
|
23 |
+
{% set system_message = false %}
|
24 |
+
{% endif %}
|
25 |
+
{{ bos_token }}
|
26 |
+
{% for message in loop_messages %}
|
27 |
+
{% if (message['role'] == 'user') != ((loop.index0 + messages[0]['role'] == 'assistant') % 2 == 0) %}
|
28 |
+
{{ raise_exception('Conversation roles must alternate starting from the first role.') }}
|
29 |
+
{% endif %}
|
30 |
+
{% if loop.index0 == 0 and system_message != false %}
|
31 |
+
{% set content = '<<SYS>>\n' + system_message + '\n<</SYS>>\n\n' + message['content'] %}
|
32 |
+
{% else %}
|
33 |
+
{% set content = message['content'] %}
|
34 |
+
{% endif %}
|
35 |
+
{% if message['role'] == 'user' %}
|
36 |
+
{{ '[INST] ' + content.strip() + ' [/INST] ' }}
|
37 |
+
{% elif message['role'] == 'system' %}
|
38 |
+
{{ '<<SYS>>\n' + content.strip() + '\n<</SYS>>\n\n' }}
|
39 |
+
{% elif message['role'] == 'assistant' %}
|
40 |
+
{{ '' + content.strip() + '' + eos_token }}
|
41 |
+
{% endif %}
|
42 |
+
{% endfor %}
|
43 |
+
"""
|
44 |
+
```
|
45 |
+
元のモデルのrevisionは`8b17f1c87697fb354952fa0d1018568e50bdff56`です。
|
46 |
+
|
47 |
Our Swallow-MS-7b-v0.1 model has undergone continual pre-training from the Mistral-7B-v0.1, primarily with the addition of Japanese language data.
|
48 |
|
49 |
# Model Release Updates
|