Hebrew LLM Leaderboard
Browse and evaluate language models
Browse and evaluate language models
Browse and compare Indic language LLMs on a leaderboard
Multilingual metrics for the LMSys Arena Leaderboard
Leaderboard for LLM for Science Reasoning
Evaluate open LLMs in the languages of LATAM and Spain.
Search and submit LLM evaluations
Track, rank and evaluate open Arabic LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Display and filter LLM leaderboard data
Explore benchmark results for QA and long doc models
Leaderboard for long LLM on In-context Learning
Visualize model performance with interactive plots and tables
Display chatbot performance leaderboard
Browse and submit LLM evaluations
Display IFEval leaderboard for language models
Submit evaluations for speaker tagging and view leaderboard
Filter and view AI model leaderboard data
Explore and submit NER models
Explore and filter LLM benchmark data
Update leaderboard for fair model evaluation