Text Generation
Transformers
PyTorch
Safetensors
English
llama
conversational
Inference Endpoints
text-generation-inference
hamishivi commited on
Commit
aa3e5cf
β€’
1 Parent(s): f275d38

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -21
README.md CHANGED
@@ -41,27 +41,12 @@ For more details, read the paper: [Camels in a Changing Climate: Enhancing LM Ad
41
 
42
  | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
43
  |-------------|-----|----|---------------|--------------|
44
- | **Tulu-v2-7b** πŸͺ | **7B** | **dDPO** | **6.30** | **73.9** |
45
- | **Tulu-v2-dpo-7b** πŸͺ | **7B** | **dDPO** | **6.29** | **85.1** |
46
- | StableLM-Tuned-Ξ± | 7B| dSFT |2.75| -|
47
- | MPT-Chat | 7B |dSFT |5.42| -|
48
- | Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|
49
- | Mistral-Instructv0.1 | 7B| - | 6.84 |-|
50
- | Zephyr-7b-Ξ± |7B| dDPO| 6.88| -|
51
- | Zephyr-7b-Ξ² πŸͺ | 7B | dDPO | 7.34 | 90.60 |
52
- | **Tulu-v2-13b** πŸͺ | **13B** | **dDPO** | **6.70** | **78.9** |
53
- | **Tulu-v2-dpo-13b** πŸͺ | **13B** | **dDPO** | **7.00** | **89.5** |
54
- | Falcon-Instruct | 40B |dSFT |5.17 |45.71|
55
- | Guanaco | 65B | SFT |6.41| 71.80|
56
- | Llama2-Chat | 70B |RLHF |6.86| 92.66|
57
- | Vicuna v1.3 | 33B |dSFT |7.12 |88.99|
58
- | WizardLM v1.0 | 70B |dSFT |7.71 |-|
59
- | Xwin-LM v0.1 | 70B |dPPO |- |95.57|
60
- | **Tulu-v2-70b** πŸͺ | **70B** | **dDPO** | **7.49** | **86.6** |
61
- | **Tulu-v2-dpo-70b** πŸͺ | **70B** | **dDPO** | **7.89** | **95.1** |
62
- | GPT-3.5-turbo | - |RLHF |7.94 |89.37|
63
- | Claude 2 | - |RLHF |8.06| 91.36|
64
- | GPT-4 | -| RLHF |8.99| 95.28|
65
 
66
  ## Input Format
67
 
 
41
 
42
  | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
43
  |-------------|-----|----|---------------|--------------|
44
+ | **Tulu-v2-7b** πŸͺ | **7B** | **SFT** | **6.30** | **73.9** |
45
+ | **Tulu-v2-dpo-7b** πŸͺ | **7B** | **DPO** | **6.29** | **85.1** |
46
+ | **Tulu-v2-13b** πŸͺ | **13B** | **SFT** | **6.70** | **78.9** |
47
+ | **Tulu-v2-dpo-13b** πŸͺ | **13B** | **DPO** | **7.00** | **89.5** |
48
+ | **Tulu-v2-70b** πŸͺ | **70B** | **SFT** | **7.49** | **86.6** |
49
+ | **Tulu-v2-dpo-70b** πŸͺ | **70B** | **DPO** | **7.89** | **95.1** |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50
 
51
  ## Input Format
52