Add Evaluation Results

#1
by usvsnsp - opened
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -1 +1,19 @@
1
- Wandb runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/s0qdwbg6?workspace=user-yongzx
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Wandb runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/s0qdwbg6?workspace=user-yongzx
2
+
3
+
4
+ Evaluation results:
5
+ | Task |Version|Filter| Metric |Value | |Stderr|
6
+ |-------------|-------|------|--------|-----:|---|-----:|
7
+ |arc_challenge|Yaml |none |acc |0.1758|± |0.0111|
8
+ | | |none |acc_norm|0.2176|± |0.0121|
9
+ |arc_easy |Yaml |none |acc |0.3742|± |0.0099|
10
+ | | |none |acc_norm|0.3565|± |0.0098|
11
+ |logiqa |Yaml |none |acc |0.2058|± |0.0159|
12
+ | | |none |acc_norm|0.2412|± |0.0168|
13
+ |piqa |Yaml |none |acc |0.5958|± |0.0114|
14
+ | | |none |acc_norm|0.5941|± |0.0115|
15
+ |sciq |Yaml |none |acc |0.5930|± |0.0155|
16
+ | | |none |acc_norm|0.5720|± |0.0157|
17
+ |winogrande |Yaml |none |acc |0.5154|± |0.0140|
18
+ |lambada_openai|Yaml |none |perplexity|730.2552|± |46.8739|
19
+ | | |none |acc | 0.1316|± | 0.0047|