Commit
·
63bc8ee
1
Parent(s):
8bf7bff
Update README.md
Browse files
README.md
CHANGED
@@ -37,6 +37,28 @@ Zephyr is a series of language models that are trained to act as helpful assista
|
|
37 |
- **Repository:** https://github.com/huggingface/alignment-handbook
|
38 |
- **Demo:** https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat
|
39 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
## Intended uses & limitations
|
41 |
|
42 |
The model was initially fine-tuned on a filtered and preprocessed of the [`UltraChat`](https://huggingface.co/datasets/stingning/ultrachat) dataset, which contains a diverse range of synthetic dialogues generated by ChatGPT.
|
|
|
37 |
- **Repository:** https://github.com/huggingface/alignment-handbook
|
38 |
- **Demo:** https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat
|
39 |
|
40 |
+
## Performance
|
41 |
+
|
42 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6200d0a443eb0913fa2df7cc/raxvt5ma16d7T23my34WC.png)
|
43 |
+
|
44 |
+
| Model | Size | Align | MT-Bench (score) | AlpacaEval (win %) |
|
45 |
+
|-------------|-----|----|---------------|--------------|
|
46 |
+
| StableLM-Tuned-α | 7B| dSFT |2.75| -|
|
47 |
+
| MPT-Chat | 7B |dSFT |5.42| -|
|
48 |
+
| Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|
|
49 |
+
| Mistral-Instructv0.1 | 7B| - | 6.84 |-|
|
50 |
+
| Zephyr-7b-α |7B| dDPO| 6.88| -|
|
51 |
+
| **Zephyr-7b-β** |7B| dDPO| 7.34| 90.60|
|
52 |
+
| Falcon-Instruct | 40B |dSFT |5.17 |45.71|
|
53 |
+
| Guanaco 65B | SFT |6.41| 71.80|
|
54 |
+
| Llama2-Chat | 70B |RLHF |6.86| 92.66|
|
55 |
+
| Vicuna v1.3 | 33B |dSFT |7.12 |88.99|
|
56 |
+
| WizardLM v1.0 | 70B |dSFT |7.71 |-|
|
57 |
+
| Xwin-LM v0.1 | 70B |dPPO |- |95.57|
|
58 |
+
| GPT-3.5-turbo | - |RLHF |7.94 |89.37|
|
59 |
+
| Claude 2 | - |RLHF |8.06| 91.36|
|
60 |
+
| GPT-4 | -| RLHF |8.99| 95.28|
|
61 |
+
|
62 |
## Intended uses & limitations
|
63 |
|
64 |
The model was initially fine-tuned on a filtered and preprocessed of the [`UltraChat`](https://huggingface.co/datasets/stingning/ultrachat) dataset, which contains a diverse range of synthetic dialogues generated by ChatGPT.
|