-
12.1kπ
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
-
106ποΈ
Open-LLM performances are plateauing, letβs make the leaderboard steep again
-
open-llm-leaderboard/contents
Viewer β’ Updated β’ 2.43k β’ 13.7k β’ 7 -
open-llm-leaderboard/results
Preview β’ Updated β’ 33.9k β’ 7
Open LLM Leaderboard
community
AI & ML interests
Evaluating open LLMs
Recent Activity
View all activity
Organization Card
Open LLM Leaderboard
This is the hub organisation maintaining the Open LLM Leaderboard.
In this space you will find the dataset with detailed results and queries for the models on the leaderboard.
Score results are here, and current state of requests is here. For the detailed prediction, look for your model name in the datasets below!
Collections
2
Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard)
spaces
5
pinned
Running
on
CPU Upgrade
12.1k
π
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Running
75
π
Open LLM Leaderboard Model Comparator
Compare Open LLM Leaderboard results
Running
106
ποΈ
Open-LLM performances are plateauing, letβs make the leaderboard steep again
Running
6
π
Exploring model generations
Runtime error
1
π
Sample Viewer
models
None public yet
datasets
2407
open-llm-leaderboard/requests
Preview
β’
Updated
β’
100k
β’
9
open-llm-leaderboard/contents
Viewer
β’
Updated
β’
2.43k
β’
13.7k
β’
7
open-llm-leaderboard/sometimesanotion__Qwentinuum-14B-v013-details
Updated
open-llm-leaderboard/results
Preview
β’
Updated
β’
33.9k
β’
7
open-llm-leaderboard/JayHyeon__Qwen2.5-0.5B-Instruct-SFT-IRPO-1epoch_v1-details
Updated
open-llm-leaderboard/JayHyeon__Qwen2.5-0.5B-Instruct-SFT-DPO-1epoch_v1-details
Updated
open-llm-leaderboard/JayHyeon__Qwen2.5-0.5B-Instruct-SFT-MDPO-1epoch_v1-details
Updated
open-llm-leaderboard/mergekit-community__mergekit-slerp-fmrazcr-details
Updated
open-llm-leaderboard/JayHyeon__Qwen2.5-0.5B-Instruct-SFT-details
Updated
open-llm-leaderboard/Sicarius-Prototyping__Micropenis_1B-details
Updated