MaziyarPanahi
commited on
Commit
•
b89671d
1
Parent(s):
9e4bf00
Update README.md (#14)
Browse files- Update README.md (c3c1a8309ac93fbdef3a4ae1bcd2ba30cd4a5093)
README.md
CHANGED
@@ -233,6 +233,17 @@ All GGUF models are available here: [MaziyarPanahi/Llama-3-70B-Instruct-DPO-v0.4
|
|
233 |
# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
234 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Llama-3-70B-Instruct-DPO-v0.4)
|
235 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
236 |
| Metric |Value|
|
237 |
|---------------------------------|----:|
|
238 |
|Avg. |78.89|
|
@@ -369,16 +380,3 @@ Here are the pros and cons of the Docker system:
|
|
369 |
Overall, Docker provides a powerful and flexible way to deploy and manage applications, but it requires careful planning, configuration, and management to ensure optimal performance and security.
|
370 |
```
|
371 |
|
372 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
373 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Llama-3-70B-Instruct-DPO-v0.4)
|
374 |
-
|
375 |
-
| Metric |Value|
|
376 |
-
|-------------------|----:|
|
377 |
-
|Avg. |32.18|
|
378 |
-
|IFEval (0-Shot) |50.27|
|
379 |
-
|BBH (3-Shot) |48.40|
|
380 |
-
|MATH Lvl 5 (4-Shot)|22.66|
|
381 |
-
|GPQA (0-shot) |11.97|
|
382 |
-
|MuSR (0-shot) |13.10|
|
383 |
-
|MMLU-PRO (5-shot) |46.71|
|
384 |
-
|
|
|
233 |
# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
234 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Llama-3-70B-Instruct-DPO-v0.4)
|
235 |
|
236 |
+
| Metric |Value|
|
237 |
+
|-------------------|----:|
|
238 |
+
|Avg. |32.18|
|
239 |
+
|IFEval (0-Shot) |50.27|
|
240 |
+
|BBH (3-Shot) |48.40|
|
241 |
+
|MATH Lvl 5 (4-Shot)|22.66|
|
242 |
+
|GPQA (0-shot) |11.97|
|
243 |
+
|MuSR (0-shot) |13.10|
|
244 |
+
|MMLU-PRO (5-shot) |46.71|
|
245 |
+
|
246 |
+
|
247 |
| Metric |Value|
|
248 |
|---------------------------------|----:|
|
249 |
|Avg. |78.89|
|
|
|
380 |
Overall, Docker provides a powerful and flexible way to deploy and manage applications, but it requires careful planning, configuration, and management to ensure optimal performance and security.
|
381 |
```
|
382 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|