File size: 1,084 Bytes
7542411
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Wandb run: https://wandb.ai/eleutherai/pythia-rlhf/runs/7f9c9lrm

Eval Results
|    Tasks     |Version|Filter|  Metric  | Value |   |Stderr|
|--------------|-------|------|----------|------:|---|-----:|
|arc_challenge |Yaml   |none  |acc       | 0.2201|±  |0.0121|
|              |       |none  |acc_norm  | 0.2568|±  |0.0128|
|arc_easy      |Yaml   |none  |acc       | 0.5253|±  |0.0102|
|              |       |none  |acc_norm  | 0.4558|±  |0.0102|
|lambada_openai|Yaml   |none  |perplexity|11.3766|±  |0.3623|
|              |       |none  |acc       | 0.4844|±  |0.0070|
|logiqa        |Yaml   |none  |acc       | 0.2120|±  |0.0160|
|              |       |none  |acc_norm  | 0.2780|±  |0.0176|
|piqa          |Yaml   |none  |acc       | 0.6817|±  |0.0109|
|              |       |none  |acc_norm  | 0.6828|±  |0.0109|
|sciq          |Yaml   |none  |acc       | 0.8130|±  |0.0123|
|              |       |none  |acc_norm  | 0.7090|±  |0.0144|
|winogrande    |Yaml   |none  |acc       | 0.5375|±  |0.0140|
|wsc           |Yaml   |none  |acc       | 0.3654|±  |0.0474|