Adding Evaluation Results
#4
by
leaderboard-pr-bot
- opened
README.md
CHANGED
@@ -9,12 +9,12 @@ tags:
|
|
9 |
- qwen
|
10 |
- moe
|
11 |
base_model: Qwen/Qwen1.5-MoE-A2.7B
|
12 |
-
model-index:
|
13 |
-
- name: models/Qwen1.5-MoE-A2.7B-Wikihow
|
14 |
-
results: []
|
15 |
datasets:
|
16 |
- HuggingFaceTB/cosmopedia
|
17 |
pipeline_tag: text-generation
|
|
|
|
|
|
|
18 |
---
|
19 |
|
20 |
# models/Qwen1.5-MoE-A2.7B-Wikihow
|
@@ -156,4 +156,17 @@ special_tokens:
|
|
156 |
- Transformers 4.40.0.dev0
|
157 |
- Pytorch 2.2.0+cu121
|
158 |
- Datasets 2.18.0
|
159 |
-
- Tokenizers 0.15.2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
- qwen
|
10 |
- moe
|
11 |
base_model: Qwen/Qwen1.5-MoE-A2.7B
|
|
|
|
|
|
|
12 |
datasets:
|
13 |
- HuggingFaceTB/cosmopedia
|
14 |
pipeline_tag: text-generation
|
15 |
+
model-index:
|
16 |
+
- name: models/Qwen1.5-MoE-A2.7B-Wikihow
|
17 |
+
results: []
|
18 |
---
|
19 |
|
20 |
# models/Qwen1.5-MoE-A2.7B-Wikihow
|
|
|
156 |
- Transformers 4.40.0.dev0
|
157 |
- Pytorch 2.2.0+cu121
|
158 |
- Datasets 2.18.0
|
159 |
+
- Tokenizers 0.15.2
|
160 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
161 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Qwen1.5-MoE-A2.7B-Wikihow)
|
162 |
+
|
163 |
+
| Metric |Value|
|
164 |
+
|-------------------|----:|
|
165 |
+
|Avg. |11.43|
|
166 |
+
|IFEval (0-Shot) |29.54|
|
167 |
+
|BBH (3-Shot) |15.47|
|
168 |
+
|MATH Lvl 5 (4-Shot)| 2.87|
|
169 |
+
|GPQA (0-shot) | 3.36|
|
170 |
+
|MuSR (0-shot) | 2.01|
|
171 |
+
|MMLU-PRO (5-shot) |15.34|
|
172 |
+
|