Nova-13B-50-step / README.md
Weyaxi's picture
Adding Evaluation Results (#2)
e02b754
|
raw
history blame
896 Bytes
metadata
license: llama2

Buy Me A Coffee

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 48.93
ARC (25-shot) 61.6
HellaSwag (10-shot) 82.31
MMLU (5-shot) 57.27
TruthfulQA (0-shot) 51.53
Winogrande (5-shot) 76.56
GSM8K (5-shot) 4.4
DROP (3-shot) 8.82