giraffe176 commited on
Commit
4e4c381
1 Parent(s): ff59f8b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -9
README.md CHANGED
@@ -198,12 +198,10 @@ Scores 65.56 on [EQ-Bench v2](https://arxiv.org/abs/2312.06281)
198
  ### [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
199
  Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_giraffe176__WestLake_Noromaid_OpenHermes_neural-chatv0.1)
200
 
201
- | Metric |Value|
202
- |---------------------------------|----:|
203
- |Avg. |68.86|
204
- |AI2 Reasoning Challenge (25-Shot)|66.72|
205
- |HellaSwag (10-Shot) |85.37|
206
- |MMLU (5-Shot) |64.67|
207
- |TruthfulQA (0-shot) |51.50|
208
- |Winogrande (5-shot) |79.72|
209
- |GSM8k (5-shot) |65.20|
 
198
  ### [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
199
  Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_giraffe176__WestLake_Noromaid_OpenHermes_neural-chatv0.1)
200
 
201
+ | | Avg. | AI2 (25-Shot) | HellaSwag (10-Shot) | MMLU (5-Shot) | TruthfulQA (0-shot) | Winogrande (5-shot) | GSM8k (5-shot) |
202
+ |:-----------------------------------------:|-------|-----------------------------------|---------------------|---------------|---------------------|---------------------|----------------|
203
+ | This model | 68.86 | 66.72 | 85.37 | 64.67 | 51.50 | 79.72 | 65.20 |
204
+ | cognitivecomputations/WestLake-7B-v2-laser| **74.78** | 73.29 | **88.66** | **64.72** | **67.04** | **86.74** | **68.23** |
205
+ | NeverSleep/Noromaid-7B-0.4-DPO | 59.08 | 62.29 | 84.32 | 63.2 | 42.28 | 76.95 | 25.47 |
206
+ | teknium/OpenHermes-2.5-Mistral-7B | 61.52 | 64.93 | 84.18 | 63.64 | 52.24 | 78.06 | 26.08 |
207
+ | Intel/neural-chat-7b-v3-3 | 69.83 | **66.89** | 85.26 | 63.07 | 63.01 | 79.64 | 61.11 |