Weyaxi's picture
Adding Evaluation Results (#2)
5e083d8
|
raw
history blame
No virus
1.73 kB
metadata
license: llama2
datasets:
  - garage-bAInd/Open-Platypus
  - Open-Orca/OpenOrca
language:
  - en

image/png

Buy Me A Coffee

OpenOrca-Platypus2-13B-QLoRA-0.80-epoch

Luban-Platypus2-13B-QLora-0.80-epoch is a merge of ai-business/Luban-13B and Platypus2-13B-QLoRA-0.80-epoch

Evaluation Results (Open LLM Leaderboard)

Metric Value
Avg. 63,94
ARC (25-shot) 60,24
HellaSwag (10-shot) 82,22
MMLU (5-shot) 58,03
TruthfulQA (0-shot) 55,26

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 49.28
ARC (25-shot) 60.24
HellaSwag (10-shot) 82.22
MMLU (5-shot) 58.03
TruthfulQA (0-shot) 55.26
Winogrande (5-shot) 75.37
GSM8K (5-shot) 0.91
DROP (3-shot) 12.95