Pankaj Mathur commited on
Commit
1332e7f
1 Parent(s): e918056

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -10
README.md CHANGED
@@ -22,16 +22,15 @@ I evaluated orca_mini_v2_13b on a wide range of tasks using [Language Model Eval
22
 
23
  Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
24
 
25
- please note num_fewshots varies for each below task as used by HuggingFaceH4 Open LLM Leaderboard
26
-
27
- |||||||
28
- |:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
29
- |**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
30
- |*arc_challenge*|25|0|acc_norm|0.5572|0.0145|
31
- |*hellaswag*|10|0|acc_norm|0.7964|0.0040|
32
- |*mmlu*|5|0|acc_norm|0.4969|0.035|
33
- |*truthfulqa_mc*|0|1|mc2|0.5231|0.0158|
34
- |*Total Average*|0|-|-|0.5933|0.0114|
35
 
36
 
37
 
 
22
 
23
  Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
24
 
25
+
26
+ ||||
27
+ |:------:|:-------------:|:---------:|
28
+ |**Task**|**Value**|**Stderr**|
29
+ |*arc_challenge*|0.5572|0.0145|
30
+ |*hellaswag*|0.7964|0.0040|
31
+ |*mmlu*|0.4969|0.035|
32
+ |*truthfulqa_mc*|0.5231|0.0158|
33
+ |*Total Average*|0.5933|0.0114|
 
34
 
35
 
36