Running on CPU Upgrade 67 67 AIR-Bench Leaderboard π₯ Explore benchmark results for QA and long doc models
Running on CPU Upgrade 12.7k 12.7k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots