Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Display LMArena Leaderboard
Explore and compare code generation models on a leaderboard
Uncensored General Intelligence Leaderboard
View and request speech models benchmark data
VLMEvalKit Evaluation Results Collection
Explore hardware performance for LLMs
Explore and filter language model benchmark results
Submit and evaluate models on GAIA leaderboard
Image Generation and Image Editing Arena & Leaderboard
Ranking of LLMs for agentic tasks
Explore and submit models for benchmarking
View LLM performance rankings
Display and search reinforcement learning leaderboard data
Upload and evaluate video models
Track, rank and evaluate open LLMs in Portuguese
Explore and analyze code completion benchmarks
Display OCRBench leaderboard and results
Display a web app using Streamlit
Track, rank and evaluate open LLMs and chatbots
Browse and compare visual document retrieval models
View and filter LLM hallucination leaderboard
Text to Video and Image to Video Arena & Leaderboard