Samantha-Nebula-7B / README.md
Weyaxi's picture
Adding Evaluation Results (#1)
26b30a0
metadata
datasets:
  - garage-bAInd/Open-Platypus
language:
  - en
license: apache-2.0

image/png

Buy Me A Coffee

Samantha-Nebula-7B

Samantha-Nebula-7B is a merge of ehartford/samantha-mistral-7b and PulsarAI/Nebula-7B

Evaluation Results (Open LLM Leaderboard)

Metric Value
Avg.
ARC (25-shot)
HellaSwag (10-shot)
MMLU (5-shot)
TruthfulQA (0-shot)

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 52.87
ARC (25-shot) 57.0
HellaSwag (10-shot) 82.25
MMLU (5-shot) 54.21
TruthfulQA (0-shot) 49.58
Winogrande (5-shot) 73.09
GSM8K (5-shot) 11.37
DROP (3-shot) 42.57