Commit
ed7d96e
1 Parent(s): 658350b

Adding Evaluation Results (#2)

Browse files

- Adding Evaluation Results (0347bce4e230c88d7d2c51e94c6aa92c10058761)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -200,6 +200,20 @@ Model evaluation on OpenLLM LeaderBoard
200
 
201
 
202
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
203
  # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
204
  Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_manishiitg__open-aditi-hi-v4)
205
 
 
200
 
201
 
202
 
203
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
204
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_manishiitg__open-aditi-hi-v4)
205
+
206
+ | Metric |Value|
207
+ |---------------------------------|----:|
208
+ |Avg. |64.23|
209
+ |AI2 Reasoning Challenge (25-Shot)|60.15|
210
+ |HellaSwag (10-Shot) |81.84|
211
+ |MMLU (5-Shot) |61.32|
212
+ |TruthfulQA (0-shot) |44.89|
213
+ |Winogrande (5-shot) |79.95|
214
+ |GSM8k (5-shot) |57.24|
215
+
216
+
217
  # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
218
  Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_manishiitg__open-aditi-hi-v4)
219