usvsnsp commited on
Commit
3df871a
1 Parent(s): da0e1bc

Add Evaluation Results

Browse files
Files changed (1) hide show
  1. README.md +17 -1
README.md CHANGED
@@ -1 +1,17 @@
1
- Wandb runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/s0qdwbg6?workspace=user-yongzx
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Wandb runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/s0qdwbg6?workspace=user-yongzx
2
+
3
+
4
+ Evaluation results:
5
+ | Task |Version|Filter| Metric |Value | |Stderr|
6
+ |-------------|-------|------|--------|-----:|---|-----:|
7
+ |arc_challenge|Yaml |none |acc |0.1758|± |0.0111|
8
+ | | |none |acc_norm|0.2176|± |0.0121|
9
+ |arc_easy |Yaml |none |acc |0.3742|± |0.0099|
10
+ | | |none |acc_norm|0.3565|± |0.0098|
11
+ |logiqa |Yaml |none |acc |0.2058|± |0.0159|
12
+ | | |none |acc_norm|0.2412|± |0.0168|
13
+ |piqa |Yaml |none |acc |0.5958|± |0.0114|
14
+ | | |none |acc_norm|0.5941|± |0.0115|
15
+ |sciq |Yaml |none |acc |0.5930|± |0.0155|
16
+ | | |none |acc_norm|0.5720|± |0.0157|
17
+ |winogrande |Yaml |none |acc |0.5154|± |0.0140|