Running on CPU Upgrade 73 AIR-Bench Leaderboard π₯ 73 Explore and compare QA and long doc benchmarks
Running on CPU Upgrade 13.7k Open LLM Leaderboard π 13.7k Track, rank and evaluate open LLMs and chatbots