Text Generation
Transformers
PyTorch
Safetensors
English
llama
conversational
text-generation-inference
Inference Endpoints
hamishivi commited on
Commit
2a3b0b8
1 Parent(s): cbe7317

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -41,7 +41,7 @@ All smaller DPO'd models have strong performance per model size in the category
41
  | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
42
  |-------------|-----|----|---------------|--------------|
43
  | **Tulu-v2-7b** 🐪 | **7B** | **dDPO** | **6.30** | **73.9** |
44
- | **Tulu-v2-dpo-7b** 🐪 | **7B** | **dDPO** | **6.27** | **85.1** |
45
  | StableLM-Tuned-α | 7B| dSFT |2.75| -|
46
  | MPT-Chat | 7B |dSFT |5.42| -|
47
  | Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|
 
41
  | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
42
  |-------------|-----|----|---------------|--------------|
43
  | **Tulu-v2-7b** 🐪 | **7B** | **dDPO** | **6.30** | **73.9** |
44
+ | **Tulu-v2-dpo-7b** 🐪 | **7B** | **dDPO** | **6.29** | **85.1** |
45
  | StableLM-Tuned-α | 7B| dSFT |2.75| -|
46
  | MPT-Chat | 7B |dSFT |5.42| -|
47
  | Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|