Nous-Hermes-2-SUS-Chat-34B-Linear
This is the model for Nous-Hermes-2-SUS-Chat-34B-Linear. I used mergekit to merge models.
Yaml Config
models:
- model: Nous-Hermes-2-Yi-34B
parameters:
weight: 0.5
- model: SUS-Chat-34B
parameters:
weight: 0.3
merge_method: linear
dtype: bfloat16
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 73.69 |
AI2 Reasoning Challenge (25-Shot) | 66.38 |
HellaSwag (10-Shot) | 84.94 |
MMLU (5-Shot) | 76.82 |
TruthfulQA (0-shot) | 59.19 |
Winogrande (5-shot) | 82.79 |
GSM8k (5-shot) | 72.02 |
- Downloads last month
- 3,040
Evaluation results
- normalized accuracy on AI2 Reasoning Challenge (25-Shot)test set Open LLM Leaderboard66.380
- normalized accuracy on HellaSwag (10-Shot)validation set Open LLM Leaderboard84.940
- accuracy on MMLU (5-Shot)test set Open LLM Leaderboard76.820
- mc2 on TruthfulQA (0-shot)validation set Open LLM Leaderboard59.190
- accuracy on Winogrande (5-shot)validation set Open LLM Leaderboard82.790
- accuracy on GSM8k (5-shot)test set Open LLM Leaderboard72.020