NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates Feb 2 • 2
Restarting on CPU Upgrade 9.94k 🏆 Open LLM Leaderboard Track, rank and evaluate open LLMs and chatbots