File size: 866 Bytes
0eeb6f4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Wandb Runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/644tyaq0?workspace=user-yongzx

Model Evals:
|    Task     |Version|Filter| Metric |Value |   |Stderr|
|-------------|-------|------|--------|-----:|---|-----:|
|arc_challenge|Yaml   |none  |acc     |0.2287|±  |0.0123|
|             |       |none  |acc_norm|0.2619|±  |0.0128|
|arc_easy     |Yaml   |none  |acc     |0.5248|±  |0.0102|
|             |       |none  |acc_norm|0.4533|±  |0.0102|
|logiqa       |Yaml   |none  |acc     |0.2089|±  |0.0159|
|             |       |none  |acc_norm|0.2765|±  |0.0175|
|piqa         |Yaml   |none  |acc     |0.6855|±  |0.0108|
|             |       |none  |acc_norm|0.6823|±  |0.0109|
|sciq         |Yaml   |none  |acc     |0.8050|±  |0.0125|
|             |       |none  |acc_norm|0.7080|±  |0.0144|
|winogrande   |Yaml   |none  |acc     |0.5335|±  |0.0140|