allenai
/

tulu-2-dpo-70b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

natolambert commited on Nov 13, 2023

Commit

c7eaf1c

•

1 Parent(s): bc6c48f

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -43,12 +43,12 @@ At the time of release, the Tulu-v2-dpo-70b model is approximately equal to GPT4
 All smaller DPO'd models have strong performance per model size in the category and with lower verbosity (average completion length).
 | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
 |-------------|-----|----|---------------|--------------|
-| **Tulu-v2-7b** 🪁 | **7B** | **dDPO** | **TODO** | **TODO** |
-| **Tulu-v2-13b** 🪁 | **13B** | **dDPO** | **TODO** | **TODO** |
-| **Tulu-v2-70b** 🪁 | **70B** | **dDPO** | **TODO** | **TODO** |
-| **Tulu-v2-dpo-7b** 🪁 | **7B** | **dDPO** | **TODO** | **TODO** |
-| **Tulu-v2-dpo-13b** 🪁 | **13B** | **dDPO** | **TODO** | **TODO** |
-| **Tulu-v2-dpo-70b** 🪁 | **70B** | **dDPO** | **TODO** | **TODO** |
 | StableLM-Tuned-α | 7B| dSFT |2.75| -|
 | MPT-Chat |  7B |dSFT |5.42| -|
 | Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|

 All smaller DPO'd models have strong performance per model size in the category and with lower verbosity (average completion length).
 | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
 |-------------|-----|----|---------------|--------------|
+| **Tulu-v2-7b** 🐪 | **7B** | **dDPO** | **TODO** | **TODO** |
+| **Tulu-v2-13b** 🐪 | **13B** | **dDPO** | **TODO** | **TODO** |
+| **Tulu-v2-70b** 🐪 | **70B** | **dDPO** | **TODO** | **TODO** |
+| **Tulu-v2-dpo-7b** 🐪 | **7B** | **dDPO** | **TODO** | **TODO** |
+| **Tulu-v2-dpo-13b** 🐪 | **13B** | **dDPO** | **TODO** | **TODO** |
+| **Tulu-v2-dpo-70b** 🐪 | **70B** | **dDPO** | **TODO** | **TODO** |
 | StableLM-Tuned-α | 7B| dSFT |2.75| -|
 | MPT-Chat |  7B |dSFT |5.42| -|
 | Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|