Pankaj Mathur
commited on
Commit
•
965ed2a
1
Parent(s):
36aabdb
Update README.md
Browse files
README.md
CHANGED
@@ -26,11 +26,11 @@ Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](htt
|
|
26 |
|||||
|
27 |
|:------:|:-------------:|:-------------:|:---------:|
|
28 |
|**Task**|**Metric**|**Value**|**Stderr**|
|
29 |
-
|*arc_challenge*|acc_norm|0.
|
30 |
-
|*hellaswag*|acc_norm|0.
|
31 |
|*mmlu*|acc_norm|0.4969|0.035|
|
32 |
-
|*truthfulqa_mc*|mc2|0.
|
33 |
-
|*Total Average*|acc_norm|0.
|
34 |
|
35 |
|
36 |
|
|
|
26 |
|||||
|
27 |
|:------:|:-------------:|:-------------:|:---------:|
|
28 |
|**Task**|**Metric**|**Value**|**Stderr**|
|
29 |
+
|*arc_challenge*|acc_norm|0.5478|0.0145|
|
30 |
+
|*hellaswag*|acc_norm|0.7023|0.0040|
|
31 |
|*mmlu*|acc_norm|0.4969|0.035|
|
32 |
+
|*truthfulqa_mc*|mc2|0.44|0.0158|
|
33 |
+
|*Total Average*|acc_norm|0.54675|0.0114|
|
34 |
|
35 |
|
36 |
|