leaderboard-pr-bot's picture
Adding Evaluation Results
3680bc8
|
raw
history blame
1.73 kB
metadata
license: llama2
datasets:
  - garage-bAInd/Open-Platypus
  - GAIR/lima
language:
  - en

image/png

Buy Me A Coffee

Limarp-Platypus2-13B-QLoRA-0.80-epoch

Limarp-Platypus2-13B-QLoRA-0.80-epoch is a merge of Oniichat/limarp-13b-merged and Platypus2-13B-QLoRA-0.80-epoch

Evaluation Results (Open LLM Leaderboard)

Metric Value
Avg. 60.98
ARC (25-shot) 60.49
HellaSwag (10-shot) 82.76
MMLU (5-shot) 56.52
TruthfulQA (0-shot) 44.14

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 47.74
ARC (25-shot) 60.49
HellaSwag (10-shot) 82.76
MMLU (5-shot) 56.52
TruthfulQA (0-shot) 44.14
Winogrande (5-shot) 76.8
GSM8K (5-shot) 6.07
DROP (3-shot) 7.4