Update README.md
Browse files
README.md
CHANGED
@@ -42,6 +42,27 @@ The summary of the instruction tuning data is as follows:
|
|
42 |
|
43 |
<center><img src="data_table.jpg" alt="Instruction Data"/></center>
|
44 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
45 |
# Reproducing the Results
|
46 |
|
47 |
We will realize the training code and the training data soon. Our training code is based on [Megatron-LM](https://github.com/NVIDIA/Megatron-LM), with some modifications to support our training data format and Maximal Update Parametrization (μP).
|
|
|
42 |
|
43 |
<center><img src="data_table.jpg" alt="Instruction Data"/></center>
|
44 |
|
45 |
+
# Instruction Format
|
46 |
+
|
47 |
+
We've added some new special tokens to the CrystalCoder tokenizer to support the instruction tuning.
|
48 |
+
|
49 |
+
List special tokens used in the instruction tuning:
|
50 |
+
|
51 |
+
```
|
52 |
+
bos: <s>
|
53 |
+
eos: </s>
|
54 |
+
system_start: <|sys_start|>
|
55 |
+
system_end: <|sys_end|>
|
56 |
+
user_start: <|im_start|>
|
57 |
+
user_end: <|im_end|>
|
58 |
+
```
|
59 |
+
|
60 |
+
The instruction format is as follows:
|
61 |
+
|
62 |
+
```
|
63 |
+
<s> <|sys_start|> system prompt <|sys_end|> <|im_start|> first user utterance <|im_end|> first model response <|im_start|> next user utterance <|im_end|> next model response </s>
|
64 |
+
```
|
65 |
+
|
66 |
# Reproducing the Results
|
67 |
|
68 |
We will realize the training code and the training data soon. Our training code is based on [Megatron-LM](https://github.com/NVIDIA/Megatron-LM), with some modifications to support our training data format and Maximal Update Parametrization (μP).
|