mamba-gpt-3b-v2 is the Best 3B Model! Surpassing dolly-v2-12b

#137
by CobraMamba - opened

The best 3B model on the Open LLM Leaderboard, with performance surpassing dolly-v2-12b

Metric Value
MMLU (5-shot) 27.1
ARC (25-shot) 42.2
HellaSwag (10-shot) 71.5
TruthfulQA (0-shot) 36.7
Avg. 44.4

We use state-of-the-art Language Model Evaluation Harness to run the benchmark tests above.

clefourrier changed discussion status to closed

Sign up or log in to comment