Benchmarks Running 26 ๐ญ Stick To Your Role! Leaderboard Configuration error 41 ๐ ZeroEval Leaderboard Running on CPU Upgrade 144 ๐ฅ MMLU Pro More advanced and challenging multi-task evaluation Running on CPU Upgrade 11.6k ๐ Open LLM Leaderboard 2 Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade 11.6k ๐ Open LLM Leaderboard 2 Track, rank and evaluate open LLMs and chatbots