pankajmathur
/

orca_mini_v2_7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pankaj Mathur commited on Jul 4, 2023

Commit

8aded8e

•

1 Parent(s): 0c3d4df

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -26,14 +26,10 @@ Here are the zero shot metrics results.
 |:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
 |**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
 |*arc_easy*|0|0|acc|0.7386|0.0090|
-|*arc_easy*|0|0|acc_norm|0.7066|0.0093|
-|*hellaswag*|0|0|acc|0.5591|0.0050|
 |*hellaswag*|0|0|acc_norm|0.7394|0.0044|
-|*truthfulqa_mc*|0|1|mc1|0.2938|0.0159|
 |*truthfulqa_mc*|0|1|mc2|0.4399|0.0153|
-|*mmlu avg*|0|1|acc|0.4108|0.0153|
-|*mmlu avg*|0|1|acc_norm|0.4108|0.0153|
-|*Total Zero Shot Average*|0|-|-|0.5373|0.011|
 Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
@@ -43,8 +39,12 @@ please note num_fewshots varies for each below task as used by HuggingFaceH4 Ope
 |||||||
 |:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
 |**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
-|*arc_challenge*|25|0|acc|0.4846|0.0146|
 |*arc_challenge*|25|0|acc_norm|0.5077|0.0146|

 |:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
 |**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
 |*arc_easy*|0|0|acc|0.7386|0.0090|
 |*hellaswag*|0|0|acc_norm|0.7394|0.0044|
 |*truthfulqa_mc*|0|1|mc2|0.4399|0.0153|
+|*mmlu*|0|1|acc_norm|0.4108|0.0153|
+|*Total Zero Shot Average*|0|-|-|0.5821|0.011|
 Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 |||||||
 |:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
 |**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
 |*arc_challenge*|25|0|acc_norm|0.5077|0.0146|
+|*hellaswag*|10|0|acc_norm|0.7617|0.0043|
+|*mmlu*|5|0|acc_norm|-|-|
+|*truthfulqa_mc*|0|1|mc2|0.4399|0.0153|
+|*Total Average*|0|-|-|0.5697|0.0114|