Add Model Evals
Browse files
README.md
CHANGED
@@ -1 +1,16 @@
|
|
1 |
-
wandb run: https://wandb.ai/eleutherai/pythia-rlhf/runs/e0drjcsz?workspace=user-yongzx
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
wandb run: https://wandb.ai/eleutherai/pythia-rlhf/runs/e0drjcsz?workspace=user-yongzx
|
2 |
+
|
3 |
+
Model Evals:
|
4 |
+
| Task |Version|Filter| Metric |Value | |Stderr|
|
5 |
+
|-------------|-------|------|--------|-----:|---|-----:|
|
6 |
+
|arc_challenge|Yaml |none |acc |0.1877|± |0.0114|
|
7 |
+
| | |none |acc_norm|0.2372|± |0.0124|
|
8 |
+
|arc_easy |Yaml |none |acc |0.4390|± |0.0102|
|
9 |
+
| | |none |acc_norm|0.4082|± |0.0101|
|
10 |
+
|logiqa |Yaml |none |acc |0.1889|± |0.0154|
|
11 |
+
| | |none |acc_norm|0.2473|± |0.0169|
|
12 |
+
|piqa |Yaml |none |acc |0.6213|± |0.0113|
|
13 |
+
| | |none |acc_norm|0.6279|± |0.0113|
|
14 |
+
|sciq |Yaml |none |acc |0.7230|± |0.0142|
|
15 |
+
| | |none |acc_norm|0.6840|± |0.0147|
|
16 |
+
|winogrande |Yaml |none |acc |0.5162|± |0.0140|
|