Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,9 @@ pipeline_tag: text-generation
|
|
11 |
|
12 |
Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
|
13 |
|
|
|
|
|
|
|
14 |
## Instruction format
|
15 |
|
16 |
In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
|
|
|
11 |
|
12 |
Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
|
13 |
|
14 |
+
|
15 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/60394599033b61166496163b/NXhaRHGVGkQV6Qtrgu83F.png)
|
16 |
+
|
17 |
## Instruction format
|
18 |
|
19 |
In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
|