Spaces

·

The AI App Directory

New Space What is Spaces?

Restarting on CPU Upgrade

Hebrew LLM Leaderboard

Browse and evaluate language models

Indic Llm Leaderboard

Browse and compare Indic language LLMs on a leaderboard

Configuration error

Leaderboard

Multilingual LMSys Chatbot Arena Leaderboard

Multilingual metrics for the LMSys Arena Leaderboard

Science Leaderboard

Leaderboard for LLM for Science Reasoning

Running on CPU Upgrade

La Leaderboard

Evaluate open LLMs in the languages of LATAM and Spain.

Occiglot Euro LLM Leaderboard

Search and submit LLM evaluations

Running on CPU Upgrade

Open Arabic LLM Leaderboard

Track, rank and evaluate open Arabic LLMs and chatbots

Low-bit Quantized Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

Swahili Llm Leaderboard

Display and filter LLM leaderboard data

AIR-Bench Leaderboard

Explore benchmark results for QA and long doc models

LongICL Leaderboard

Leaderboard for long LLM on In-context Learning

Open CoT Dashboard

Visualize model performance with interactive plots and tables

Effibench Leaderboard

Display chatbot performance leaderboard

Open Tw Llm Leaderboard

Browse and submit LLM evaluations

IL-TUR Leaderboard

Multimodal Hallucination Leaderboard

IFEval Leaderboard

Display IFEval leaderboard for language models

EgoPlan-Bench Leaderboard

Post-ASR LLM based Speaker Tagging Leaderboard

Submit evaluations for speaker tagging and view leaderboard

LLM Leaderboard for CRM

Filter and view AI model leaderboard data

Clinical NER Leaderboard

Explore and submit NER models

Icelandic LLM leaderboard

Explore and filter LLM benchmark data

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Update leaderboard for fair model evaluation