File size: 1,038 Bytes
7908b97
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c67fb50
7908b97
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Wandb Runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/644tyaq0?workspace=user-yongzx

Model Evals:
|    Task     |Version|Filter| Metric |Value |   |Stderr|
|-------------|-------|------|--------|-----:|---|-----:|
|arc_challenge|Yaml   |none  |acc     |0.2287|±  |0.0123|
|             |       |none  |acc_norm|0.2619|±  |0.0128|
|arc_easy     |Yaml   |none  |acc     |0.5248|±  |0.0102|
|             |       |none  |acc_norm|0.4533|±  |0.0102|
|logiqa       |Yaml   |none  |acc     |0.2089|±  |0.0159|
|             |       |none  |acc_norm|0.2765|±  |0.0175|
|piqa         |Yaml   |none  |acc     |0.6855|±  |0.0108|
|             |       |none  |acc_norm|0.6823|±  |0.0109|
|sciq         |Yaml   |none  |acc     |0.8050|±  |0.0125|
|             |       |none  |acc_norm|0.7080|±  |0.0144|
|winogrande   |Yaml   |none  |acc     |0.5335|±  |0.0140|
|wsc |Yaml   |none  |acc   |0.3654|±  |0.0474|
|lambada_openai|Yaml   |none  |perplexity|9.8265|±  |0.3139|
|              |       |none  |acc       |0.5135|±  |0.0070|