Running on CPU Upgrade 11.8k 🏆 Open LLM Leaderboard 2 Track, rank and evaluate open LLMs and chatbots
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution Paper • 2410.16256 • Published Oct 21 • 58