Adding Evaluation Results
#1
by
leaderboard-pr-bot
- opened
README.md
CHANGED
@@ -56,4 +56,17 @@ The following hyperparameters were used during training:
|
|
56 |
- Transformers 4.40.2
|
57 |
- Pytorch 2.3.0+cu121
|
58 |
- Datasets 2.19.1
|
59 |
-
- Tokenizers 0.19.1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
56 |
- Transformers 4.40.2
|
57 |
- Pytorch 2.3.0+cu121
|
58 |
- Datasets 2.19.1
|
59 |
+
- Tokenizers 0.19.1
|
60 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
61 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_langgptai__qwen1.5-7b-chat-sa-v0.1)
|
62 |
+
|
63 |
+
| Metric |Value|
|
64 |
+
|-------------------|----:|
|
65 |
+
|Avg. |16.58|
|
66 |
+
|IFEval (0-Shot) |42.68|
|
67 |
+
|BBH (3-Shot) |20.30|
|
68 |
+
|MATH Lvl 5 (4-Shot)| 3.02|
|
69 |
+
|GPQA (0-shot) | 8.28|
|
70 |
+
|MuSR (0-shot) | 3.06|
|
71 |
+
|MMLU-PRO (5-shot) |22.14|
|
72 |
+
|