File size: 592 Bytes
1f88a5f
 
 
 
5f5a78b
 
1f88a5f
 
5f5a78b
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
license: cc-by-nc-4.0
datasets:
- HuggingFaceH4/ultrafeedback_binarized
language:
- en
---

Trained for one epoch on ultrafeedback_binarized using cDPO. Evaluation pending.

Some initial benchmark results:
|  Task   |Version| Metric |Value |   |Stderr|
|---------|------:|--------|-----:|---|-----:|
|hellaswag|      0|acc     |0.6621|±  |0.0047|
|         |       |acc_norm|0.8525|±  |0.0035|
|arc_challenge|      0|acc     |0.6348|±  |0.0141|
|             |       |acc_norm|0.6698|±  |0.0137|
|winogrande|      0|acc   |0.7861|±  |0.0115|
|gsm8k|      0|acc   |0.5694|±  |0.0136|