Running on CPU Upgrade 13k 13k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Running 553 553 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute