NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates Feb 2 • 2
Running on CPU Upgrade 11.7k 🏆 Open LLM Leaderboard 2 Track, rank and evaluate open LLMs and chatbots