pythia-70m-ppo / README.md
usvsnsp's picture
Create README.md
978cc7b

Wandb Run: https://wandb.ai/eleutherai/pythia-rlhf/runs/gy2g8jj1

Model Evals:

Tasks Version Filter Metric Value Stderr
arc_challenge Yaml none acc 0.2253 ± 0.0122
none acc_norm 0.2278 ± 0.0123
arc_easy Yaml none acc 0.2551 ± 0.0089
none acc_norm 0.2567 ± 0.0090
lambada_openai Yaml none perplexity NaN ± NaN
none acc 0.0016 ± 0.0005
logiqa Yaml none acc 0.2028 ± 0.0158
none acc_norm 0.2028 ± 0.0158
piqa Yaml none acc 0.4946 ± 0.0117
none acc_norm 0.4924 ± 0.0117
sciq Yaml none acc 0.0140 ± 0.0037
none acc_norm 0.0140 ± 0.0037
winogrande Yaml none acc 0.5036 ± 0.0141
wsc Yaml none acc 0.6346 ± 0.0474