pankajmathur
/

orca_mini_v2_13b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pankaj Mathur commited on Jul 9, 2023

Commit

1332e7f

•

1 Parent(s): e918056

Update README.md

Files changed (1) hide show

README.md +9 -10

README.md CHANGED Viewed

@@ -22,16 +22,15 @@ I evaluated orca_mini_v2_13b on a wide range of tasks using [Language Model Eval
 Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-please note num_fewshots varies for each below task as used by HuggingFaceH4 Open LLM Leaderboard
-|||||||
-|:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
-|**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
-|*arc_challenge*|25|0|acc_norm|0.5572|0.0145|
-|*hellaswag*|10|0|acc_norm|0.7964|0.0040|
-|*mmlu*|5|0|acc_norm|0.4969|0.035|
-|*truthfulqa_mc*|0|1|mc2|0.5231|0.0158|
-|*Total Average*|0|-|-|0.5933|0.0114|

 Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+||||
+|:------:|:-------------:|:---------:|
+|**Task**|**Value**|**Stderr**|
+|*arc_challenge*|0.5572|0.0145|
+|*hellaswag*|0.7964|0.0040|
+|*mmlu*|0.4969|0.035|
+|*truthfulqa_mc*|0.5231|0.0158|
+|*Total Average*|0.5933|0.0114|