Update README.md
Browse files
README.md
CHANGED
@@ -28,4 +28,24 @@ We trained using the [alignment handbook recipe](https://github.com/huggingface/
|
|
28 |
|
29 |
Visit the [W&B workspace here](https://wandb.ai/llm_surgery/mistral_zephyr_orpo_v0.2?nw=nwusercapecape)
|
30 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
31 |
## Trained on a single H100 for 2 hours!
|
|
|
28 |
|
29 |
Visit the [W&B workspace here](https://wandb.ai/llm_surgery/mistral_zephyr_orpo_v0.2?nw=nwusercapecape)
|
30 |
|
31 |
+
## Results:
|
32 |
+
|
33 |
+
- MT bench
|
34 |
+
```
|
35 |
+
########## First turn ##########
|
36 |
+
score
|
37 |
+
model turn
|
38 |
+
zephyr-orpo-7b-v0.2 1 7.44375
|
39 |
+
|
40 |
+
########## Second turn ##########
|
41 |
+
score
|
42 |
+
model turn
|
43 |
+
zephyr-orpo-7b-v0.2 2 6.875
|
44 |
+
|
45 |
+
########## Average ##########
|
46 |
+
score
|
47 |
+
model
|
48 |
+
zephyr-orpo-7b-v0.2 7.159375
|
49 |
+
```
|
50 |
+
|
51 |
## Trained on a single H100 for 2 hours!
|