anhnv125
/

pygmalion-6b-roleplay

Text Generation

Inference Endpoints

Model card Files Files and versions Community

pygmalion-6b-roleplay / README.md

leaderboard-pr-bot's picture

leaderboard-pr-bot

Adding Evaluation Results

a720fa6 about 1 year ago

|

662 Bytes

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	33.66
ARC (25-shot)	40.53
HellaSwag (10-shot)	67.47
MMLU (5-shot)	25.73
TruthfulQA (0-shot)	32.53
Winogrande (5-shot)	62.67
GSM8K (5-shot)	1.14
DROP (3-shot)	5.56