breadlicker45
/

dough-instruct-base-001

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

dough-instruct-base-001 / README.md

leaderboard-pr-bot's picture

leaderboard-pr-bot

Adding Evaluation Results

8e26681 11 months ago

|

709 Bytes

metadata

datasets:
  - breadlicker45/bread-qa

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	25.22
ARC (25-shot)	23.89
HellaSwag (10-shot)	24.76
MMLU (5-shot)	23.13
TruthfulQA (0-shot)	53.4
Winogrande (5-shot)	51.07
GSM8K (5-shot)	0.0
DROP (3-shot)	0.29