LlaMA-2 License, more details coming soon...

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Detailed results can be found here

Spaces using pankajmathur/model_420_preview 12

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

67.060
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

87.260
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

69.850
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

44.570
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

83.350
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

33.210