Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Select and filter benchmarks for text embedding tasks
Ranking of LLMs for agentic tasks
Track, rank and evaluate open LLMs and chatbots in French
Korean Leaderboard
Display and filter a leaderboard of UGI models
Submit code models for evaluation on benchmarks
Submit and evaluate models on a leaderboard
Display and filter an open-r1 model leaderboard
VLMEvalKit Evaluation Results Collection
Display energy efficiency scores for AI models
The only leaderboard you will require for your RAG needs ๐
Generate images from text descriptions
Upload and evaluate video models
Browse and submit LLM evaluations
Explore and filter language model benchmark results
Explore and submit LLM benchmark evaluations
Display OCRBench leaderboard for model evaluations
Track, rank and evaluate open LLMs in Portuguese
Explore and analyze code evaluation data
Leaderboard and arena of Video Generation models
Explore model leaderboards for various NLP tasks
Open Persian LLM Leaderboard