File size: 1,072 Bytes
5956541
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c3bb61f
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
wandb: https://wandb.ai/eleutherai/pythia-rlhf/runs/8p0wfi7m?workspace=user-yongzx

Model Evals:
|     Task     |Version|Filter|  Metric  |Value |   |Stderr|
|--------------|-------|------|----------|-----:|---|-----:|
|arc_challenge |Yaml   |none  |acc       |0.2654|±  |0.0129|
|              |       |none  |acc_norm  |0.2875|±  |0.0132|
|arc_easy      |Yaml   |none  |acc       |0.6149|±  |0.0100|
|              |       |none  |acc_norm  |0.5391|±  |0.0102|
|lambada_openai|Yaml   |none  |perplexity|5.6120|±  |0.1509|
|              |       |none  |acc       |0.6146|±  |0.0068|
|logiqa        |Yaml   |none  |acc       |0.1951|±  |0.0155|
|              |       |none  |acc_norm  |0.2796|±  |0.0176|
|piqa          |Yaml   |none  |acc       |0.7160|±  |0.0105|
|              |       |none  |acc_norm  |0.7182|±  |0.0105|
|sciq          |Yaml   |none  |acc       |0.8610|±  |0.0109|
|              |       |none  |acc_norm  |0.7930|±  |0.0128|
|winogrande    |Yaml   |none  |acc       |0.5754|±  |0.0139|
|wsc |Yaml   |none  |acc   |0.3654|±  |0.0474|