Weyaxi's picture
Adding Evaluation Results (#1)
ac325d0
|
raw
history blame
1.82 kB
metadata
license: llama2
datasets:
  - garage-bAInd/Open-Platypus
  - Open-Orca/OpenOrca
  - totally-not-an-llm/EverythingLM-data-V2
language:
  - en

image/png

Buy Me A Coffee

PuddleJumper-Platypus2-13B-QLoRA-0.80-epoch

PuddleJumper-Platypus2-13B-QLoRA-0.80-epoch is a merge of totally-not-an-llm/PuddleJumper-13b and Platypus2-13B-QLoRA-0.80-epoch

Evaluation Results (Open LLM Leaderboard)

Metric Value
Avg. 60.84
ARC (25-shot) 54.52
HellaSwag (10-shot) 79.36
MMLU (5-shot) 55.15
TruthfulQA (0-shot) 54.32

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 47.85
ARC (25-shot) 54.52
HellaSwag (10-shot) 79.36
MMLU (5-shot) 55.15
TruthfulQA (0-shot) 54.32
Winogrande (5-shot) 71.11
GSM8K (5-shot) 0.0
DROP (3-shot) 20.49