TinyPixel
/

elm-test

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

elm-test / README.md

leaderboard-pr-bot's picture

leaderboard-pr-bot

Adding Evaluation Results

6bf7a09 8 months ago

|

No virus

650 Bytes

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	43.74
ARC (25-shot)	53.16
HellaSwag (10-shot)	78.98
MMLU (5-shot)	47.04
TruthfulQA (0-shot)	39.51
Winogrande (5-shot)	74.35
GSM8K (5-shot)	7.51
DROP (3-shot)	5.65