File size: 1,084 Bytes
7542411 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
Wandb run: https://wandb.ai/eleutherai/pythia-rlhf/runs/7f9c9lrm
Eval Results
| Tasks |Version|Filter| Metric | Value | |Stderr|
|--------------|-------|------|----------|------:|---|-----:|
|arc_challenge |Yaml |none |acc | 0.2201|± |0.0121|
| | |none |acc_norm | 0.2568|± |0.0128|
|arc_easy |Yaml |none |acc | 0.5253|± |0.0102|
| | |none |acc_norm | 0.4558|± |0.0102|
|lambada_openai|Yaml |none |perplexity|11.3766|± |0.3623|
| | |none |acc | 0.4844|± |0.0070|
|logiqa |Yaml |none |acc | 0.2120|± |0.0160|
| | |none |acc_norm | 0.2780|± |0.0176|
|piqa |Yaml |none |acc | 0.6817|± |0.0109|
| | |none |acc_norm | 0.6828|± |0.0109|
|sciq |Yaml |none |acc | 0.8130|± |0.0123|
| | |none |acc_norm | 0.7090|± |0.0144|
|winogrande |Yaml |none |acc | 0.5375|± |0.0140|
|wsc |Yaml |none |acc | 0.3654|± |0.0474| |