leaderboard-pr-bot
commited on
Commit
•
e897790
1
Parent(s):
5af92a0
Adding Evaluation Results
Browse filesThis is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr
The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.
If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions
README.md
CHANGED
@@ -313,3 +313,17 @@ Furthermore, some aspects of string theory suggest that the fundamental constitu
|
|
313 |
In summary, while there is no direct connection between plasma propulsion systems and string theory, there is an indirect connection through the use of the equations of classical electromagnetism, which are also used in string theory. Additionally, some aspects of string theory suggest that the fundamental constituents of matter may have additional properties beyond those described by classical physics.
|
314 |
```
|
315 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
313 |
In summary, while there is no direct connection between plasma propulsion systems and string theory, there is an indirect connection through the use of the equations of classical electromagnetism, which are also used in string theory. Additionally, some aspects of string theory suggest that the fundamental constituents of matter may have additional properties beyond those described by classical physics.
|
314 |
```
|
315 |
|
316 |
+
|
317 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
318 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_migtissera__Synthia-70B-v1.2b)
|
319 |
+
|
320 |
+
| Metric | Value |
|
321 |
+
|-----------------------|---------------------------|
|
322 |
+
| Avg. | 64.63 |
|
323 |
+
| ARC (25-shot) | 68.77 |
|
324 |
+
| HellaSwag (10-shot) | 87.57 |
|
325 |
+
| MMLU (5-shot) | 68.81 |
|
326 |
+
| TruthfulQA (0-shot) | 57.69 |
|
327 |
+
| Winogrande (5-shot) | 83.9 |
|
328 |
+
| GSM8K (5-shot) | 35.25 |
|
329 |
+
| DROP (3-shot) | 50.41 |
|