Update README.md
Browse files
README.md
CHANGED
@@ -41,7 +41,7 @@ All smaller DPO'd models have strong performance per model size in the category
|
|
41 |
| Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
|
42 |
|-------------|-----|----|---------------|--------------|
|
43 |
| **Tulu-v2-7b** 🐪 | **7B** | **dDPO** | **6.30** | **73.9** |
|
44 |
-
| **Tulu-v2-dpo-7b** 🐪 | **7B** | **dDPO** | **6.
|
45 |
| StableLM-Tuned-α | 7B| dSFT |2.75| -|
|
46 |
| MPT-Chat | 7B |dSFT |5.42| -|
|
47 |
| Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|
|
|
|
41 |
| Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
|
42 |
|-------------|-----|----|---------------|--------------|
|
43 |
| **Tulu-v2-7b** 🐪 | **7B** | **dDPO** | **6.30** | **73.9** |
|
44 |
+
| **Tulu-v2-dpo-7b** 🐪 | **7B** | **dDPO** | **6.29** | **85.1** |
|
45 |
| StableLM-Tuned-α | 7B| dSFT |2.75| -|
|
46 |
| MPT-Chat | 7B |dSFT |5.42| -|
|
47 |
| Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|
|