giraffe176
commited on
Commit
•
4e4c381
1
Parent(s):
ff59f8b
Update README.md
Browse files
README.md
CHANGED
@@ -198,12 +198,10 @@ Scores 65.56 on [EQ-Bench v2](https://arxiv.org/abs/2312.06281)
|
|
198 |
### [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
199 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_giraffe176__WestLake_Noromaid_OpenHermes_neural-chatv0.1)
|
200 |
|
201 |
-
|
|
202 |
-
|
203 |
-
|
|
204 |
-
|
|
205 |
-
|
|
206 |
-
|
|
207 |
-
|
|
208 |
-
|Winogrande (5-shot) |79.72|
|
209 |
-
|GSM8k (5-shot) |65.20|
|
|
|
198 |
### [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
199 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_giraffe176__WestLake_Noromaid_OpenHermes_neural-chatv0.1)
|
200 |
|
201 |
+
| | Avg. | AI2 (25-Shot) | HellaSwag (10-Shot) | MMLU (5-Shot) | TruthfulQA (0-shot) | Winogrande (5-shot) | GSM8k (5-shot) |
|
202 |
+
|:-----------------------------------------:|-------|-----------------------------------|---------------------|---------------|---------------------|---------------------|----------------|
|
203 |
+
| This model | 68.86 | 66.72 | 85.37 | 64.67 | 51.50 | 79.72 | 65.20 |
|
204 |
+
| cognitivecomputations/WestLake-7B-v2-laser| **74.78** | 73.29 | **88.66** | **64.72** | **67.04** | **86.74** | **68.23** |
|
205 |
+
| NeverSleep/Noromaid-7B-0.4-DPO | 59.08 | 62.29 | 84.32 | 63.2 | 42.28 | 76.95 | 25.47 |
|
206 |
+
| teknium/OpenHermes-2.5-Mistral-7B | 61.52 | 64.93 | 84.18 | 63.64 | 52.24 | 78.06 | 26.08 |
|
207 |
+
| Intel/neural-chat-7b-v3-3 | 69.83 | **66.89** | 85.26 | 63.07 | 63.01 | 79.64 | 61.11 |
|
|
|
|