euclaise commited on
Commit
72ab9f1
1 Parent(s): fa76e5f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -31,3 +31,5 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
31
  | Winogrande (5-shot) | 77.35 |
32
  | GSM8K (5-shot) | 5.76 |
33
  | DROP (3-shot) | 5.87 |
 
 
 
31
  | Winogrande (5-shot) | 77.35 |
32
  | GSM8K (5-shot) | 5.76 |
33
  | DROP (3-shot) | 5.87 |
34
+
35
+ I'm not sure what's going on with GSM8K. Since GSK8K (train split) data was included in the Ferret dataset, I suspect that either it is over-correcting itself or the eval is broken.