KnutJaegersberg's picture
Adding Evaluation Results (#2)
b32d2ad
metadata
license: cc-by-nc-4.0

Prompt: "Below is an instruction that describes a task. Write a response that appropriately completes the request. \n\n### Instruction:\n INSTRUCTION. \n### Response:\n"

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 27.41
ARC (25-shot) 27.39
HellaSwag (10-shot) 38.46
MMLU (5-shot) 25.67
TruthfulQA (0-shot) 42.76
Winogrande (5-shot) 53.51
GSM8K (5-shot) 0.15
DROP (3-shot) 3.91