leaderboard-pr-bot
commited on
Commit
•
5310f49
1
Parent(s):
dd34258
Adding Evaluation Results
Browse filesThis is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr
The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.
If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions
README.md
CHANGED
@@ -1,19 +1,19 @@
|
|
1 |
-
---
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
-
|
7 |
-
-
|
8 |
-
|
9 |
-
-
|
10 |
-
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
---
|
17 |
|
18 |
![Magpie](https://cdn-uploads.huggingface.co/production/uploads/653df1323479e9ebbe3eb6cc/FWWILXrAGNwWr52aghV0S.png)
|
19 |
## 🔥 Chat with Magpie [Here](https://huggingface.co/spaces/flydust/Chat-with-Magpie)!
|
@@ -200,4 +200,17 @@ ArmoRM paper:
|
|
200 |
}
|
201 |
```
|
202 |
|
203 |
-
**Questions?** Please contact [Zhangchen](https://zhangchenxu.com/) by email.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
license: llama3.1
|
5 |
+
tags:
|
6 |
+
- alignment-handbook
|
7 |
+
- trl
|
8 |
+
- dpo
|
9 |
+
- generated_from_trainer
|
10 |
+
base_model: Magpie-Align/Llama-3.1-8B-Magpie-Align-SFT-v0.1
|
11 |
+
datasets:
|
12 |
+
- princeton-nlp/llama3-ultrafeedback-armorm
|
13 |
+
model-index:
|
14 |
+
- name: Llama-3.1-8B-Magpie-Align-v0.1
|
15 |
+
results: []
|
16 |
+
---
|
17 |
|
18 |
![Magpie](https://cdn-uploads.huggingface.co/production/uploads/653df1323479e9ebbe3eb6cc/FWWILXrAGNwWr52aghV0S.png)
|
19 |
## 🔥 Chat with Magpie [Here](https://huggingface.co/spaces/flydust/Chat-with-Magpie)!
|
|
|
200 |
}
|
201 |
```
|
202 |
|
203 |
+
**Questions?** Please contact [Zhangchen](https://zhangchenxu.com/) by email.
|
204 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
205 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Magpie-Align__Llama-3.1-8B-Magpie-Align-v0.1)
|
206 |
+
|
207 |
+
| Metric |Value|
|
208 |
+
|-------------------|----:|
|
209 |
+
|Avg. |16.44|
|
210 |
+
|IFEval (0-Shot) |44.58|
|
211 |
+
|BBH (3-Shot) |24.04|
|
212 |
+
|MATH Lvl 5 (4-Shot)| 0.00|
|
213 |
+
|GPQA (0-shot) | 1.79|
|
214 |
+
|MuSR (0-shot) | 3.09|
|
215 |
+
|MMLU-PRO (5-shot) |25.13|
|
216 |
+
|