Experimental Ties-Merge between 5 Models and 2 LORAs at varying weights and densities.
And trained with some dataset.
This is purely for my personal testing. Use if you want.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 49.62 |
ARC (25-shot) | 56.48 |
HellaSwag (10-shot) | 78.57 |
MMLU (5-shot) | 51.56 |
TruthfulQA (0-shot) | 47.7 |
Winogrande (5-shot) | 75.06 |
GSM8K (5-shot) | 1.44 |
DROP (3-shot) | 36.53 |
- Downloads last month
- 4,096