allenai
/

tulu-2-dpo-70b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Nov 18, 2023

Commit

2a3b0b8

•

1 Parent(s): cbe7317

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -41,7 +41,7 @@ All smaller DPO'd models have strong performance per model size in the category
 | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
 |-------------|-----|----|---------------|--------------|
 | **Tulu-v2-7b** 🐪 | **7B** | **dDPO** | **6.30** | **73.9** |
-| **Tulu-v2-dpo-7b** 🐪 | **7B** | **dDPO** | **6.27** | **85.1** |
 | StableLM-Tuned-α | 7B| dSFT |2.75| -|
 | MPT-Chat |  7B |dSFT |5.42| -|
 | Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|

 | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
 |-------------|-----|----|---------------|--------------|
 | **Tulu-v2-7b** 🐪 | **7B** | **dDPO** | **6.30** | **73.9** |
+| **Tulu-v2-dpo-7b** 🐪 | **7B** | **dDPO** | **6.29** | **85.1** |
 | StableLM-Tuned-α | 7B| dSFT |2.75| -|
 | MPT-Chat |  7B |dSFT |5.42| -|
 | Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|