256_5epoch / README.md
Corianas's picture
Adding Evaluation Results (#2)
2a2bd0f
metadata
license: cc-by-nc-sa-4.0

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 25.35
ARC (25-shot) 22.27
HellaSwag (10-shot) 28.99
MMLU (5-shot) 26.62
TruthfulQA (0-shot) 41.71
Winogrande (5-shot) 52.72
GSM8K (5-shot) 0.23
DROP (3-shot) 4.93