Tianhua commited on
Commit
b279ee7
1 Parent(s): f692d0d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md CHANGED
@@ -42,6 +42,27 @@ The summary of the instruction tuning data is as follows:
42
 
43
  <center><img src="data_table.jpg" alt="Instruction Data"/></center>
44
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
45
  # Reproducing the Results
46
 
47
  We will realize the training code and the training data soon. Our training code is based on [Megatron-LM](https://github.com/NVIDIA/Megatron-LM), with some modifications to support our training data format and Maximal Update Parametrization (μP).
 
42
 
43
  <center><img src="data_table.jpg" alt="Instruction Data"/></center>
44
 
45
+ # Instruction Format
46
+
47
+ We've added some new special tokens to the CrystalCoder tokenizer to support the instruction tuning.
48
+
49
+ List special tokens used in the instruction tuning:
50
+
51
+ ```
52
+ bos: <s>
53
+ eos: </s>
54
+ system_start: <|sys_start|>
55
+ system_end: <|sys_end|>
56
+ user_start: <|im_start|>
57
+ user_end: <|im_end|>
58
+ ```
59
+
60
+ The instruction format is as follows:
61
+
62
+ ```
63
+ <s> <|sys_start|> system prompt <|sys_end|> <|im_start|> first user utterance <|im_end|> first model response <|im_start|> next user utterance <|im_end|> next model response </s>
64
+ ```
65
+
66
  # Reproducing the Results
67
 
68
  We will realize the training code and the training data soon. Our training code is based on [Megatron-LM](https://github.com/NVIDIA/Megatron-LM), with some modifications to support our training data format and Maximal Update Parametrization (μP).