hamishivi commited on
Commit
4a372aa
1 Parent(s): ec4d2dd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -47,7 +47,7 @@ It even beats Tulu 2+DPO 70B in some cases, although it loses out in harder reas
47
  For details on training and evaluation, read [our paper](https://link.todo)!
48
 
49
 
50
- | Model | Size | Alignment | AlpacaEval 2 Winrate (LC) | GSM8k 8-shot CoT Acc. | Average Perf. across Open-Instruct evals |
51
  |-|-|-|-|-|-|
52
  | **Tulu V2.5 PPO 13B (this model)** | 13B | PPO with 70B RM | 58.0 | **26.7** | 62.8 |
53
  | **Tulu V2 DPO 13B** | 13B | DPO | 50.5 | 16.0 | 61.0 |
 
47
  For details on training and evaluation, read [our paper](https://link.todo)!
48
 
49
 
50
+ | Model | Size | Alignment | GSM8k 8-shot CoT Acc. | AlpacaEval 2 Winrate (LC) | Average Perf. across Open-Instruct evals |
51
  |-|-|-|-|-|-|
52
  | **Tulu V2.5 PPO 13B (this model)** | 13B | PPO with 70B RM | 58.0 | **26.7** | 62.8 |
53
  | **Tulu V2 DPO 13B** | 13B | DPO | 50.5 | 16.0 | 61.0 |