Update README.md
#11
by
MaziyarPanahi
- opened
README.md
CHANGED
@@ -244,6 +244,21 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
244 |
|
245 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fd5e18a90b6dc4633f6d292/ExIVXtyzYIYgilY_MxAPY.png)
|
246 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
247 |
# Prompt Template
|
248 |
|
249 |
This model uses `ChatML` prompt template:
|
@@ -320,16 +335,4 @@ outputs = pipeline(
|
|
320 |
print(outputs[0]["generated_text"][len(prompt):])
|
321 |
```
|
322 |
|
323 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
324 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Llama-3-8B-Instruct-v0.8)
|
325 |
-
|
326 |
-
| Metric |Value|
|
327 |
-
|-------------------|----:|
|
328 |
-
|Avg. |26.75|
|
329 |
-
|IFEval (0-Shot) |75.12|
|
330 |
-
|BBH (3-Shot) |28.27|
|
331 |
-
|MATH Lvl 5 (4-Shot)| 7.10|
|
332 |
-
|GPQA (0-shot) | 7.38|
|
333 |
-
|MuSR (0-shot) |10.92|
|
334 |
-
|MMLU-PRO (5-shot) |31.68|
|
335 |
|
|
|
244 |
|
245 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fd5e18a90b6dc4633f6d292/ExIVXtyzYIYgilY_MxAPY.png)
|
246 |
|
247 |
+
|
248 |
+
**Leaderboard 2.0:**
|
249 |
+
|
250 |
+
| Metric |Value|
|
251 |
+
|-------------------|----:|
|
252 |
+
|Avg. |26.75|
|
253 |
+
|IFEval (0-Shot) |75.12|
|
254 |
+
|BBH (3-Shot) |28.27|
|
255 |
+
|MATH Lvl 5 (4-Shot)| 7.10|
|
256 |
+
|GPQA (0-shot) | 7.38|
|
257 |
+
|MuSR (0-shot) |10.92|
|
258 |
+
|MMLU-PRO (5-shot) |31.68|
|
259 |
+
|
260 |
+
|
261 |
+
|
262 |
# Prompt Template
|
263 |
|
264 |
This model uses `ChatML` prompt template:
|
|
|
335 |
print(outputs[0]["generated_text"][len(prompt):])
|
336 |
```
|
337 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
338 |
|