MaziyarPanahi
commited on
Commit
•
daff11e
1
Parent(s):
5ac1cdf
Adding Evaluation Results
Browse filesThis is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr
The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.
If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions
README.md
CHANGED
@@ -16,7 +16,6 @@ tags:
|
|
16 |
base_model: meta-llama/Meta-Llama-3-70B-Instruct
|
17 |
datasets:
|
18 |
- argilla/ultrafeedback-binarized-preferences
|
19 |
-
model_name: Llama-3-70B-Instruct-DPO-v0.4
|
20 |
pipeline_tag: text-generation
|
21 |
license_name: llama3
|
22 |
license_link: LICENSE
|
@@ -273,3 +272,17 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
273 |
|Winogrande (5-shot) |83.58|
|
274 |
|GSM8k (5-shot) |87.34|
|
275 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
base_model: meta-llama/Meta-Llama-3-70B-Instruct
|
17 |
datasets:
|
18 |
- argilla/ultrafeedback-binarized-preferences
|
|
|
19 |
pipeline_tag: text-generation
|
20 |
license_name: llama3
|
21 |
license_link: LICENSE
|
|
|
272 |
|Winogrande (5-shot) |83.58|
|
273 |
|GSM8k (5-shot) |87.34|
|
274 |
|
275 |
+
|
276 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
277 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Llama-3-70B-Instruct-DPO-v0.4)
|
278 |
+
|
279 |
+
| Metric |Value|
|
280 |
+
|---------------------------------|----:|
|
281 |
+
|Avg. |78.89|
|
282 |
+
|AI2 Reasoning Challenge (25-Shot)|72.61|
|
283 |
+
|HellaSwag (10-Shot) |86.03|
|
284 |
+
|MMLU (5-Shot) |80.50|
|
285 |
+
|TruthfulQA (0-shot) |63.26|
|
286 |
+
|Winogrande (5-shot) |83.58|
|
287 |
+
|GSM8k (5-shot) |87.34|
|
288 |
+
|