Weyaxi
/

Chat-AYB-Nova-13B

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Chat-AYB-Nova-13B / README.md

leaderboard-pr-bot's picture

leaderboard-pr-bot

Adding Evaluation Results

fd26a08 7 months ago

|

raw history blame

No virus

659 Bytes

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	50.73
ARC (25-shot)	62.97
HellaSwag (10-shot)	84.28
MMLU (5-shot)	58.58
TruthfulQA (0-shot)	51.28
Winogrande (5-shot)	77.58
GSM8K (5-shot)	12.36
DROP (3-shot)	8.03