Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard)
Open LLM Leaderboard
community
AI & ML interests
Evaluating open LLMs
Organization Card
About org cards
Open LLM Leaderboard
This is the hub organisation maintaining the Open LLM Leaderboard.
In this space you will find the dataset with detailed results and queries for the models on the leaderboard.
Score results are here, and current state of requests is here. For the detailed prediction, look for your model name in the datasets below!
Collections
3
A daily uploaded list of models with best evaluations on the LLM leaderboard:
-
chargoddard/Yi-34B-Llama
Text Generation • Updated • 3.46k • 56 -
yunconglong/Truthful_DPO_TomGrc_FusionNet_7Bx2_MoE_13B
Text Generation • Updated • 4.24k • 52 -
fblgit/UNA-SimpleSmaug-34b-v1beta
Text Generation • Updated • 2.53k • 20 -
cloudyu/TomGrc_FusionNet_34Bx2_MoE_v0.1_DPO_f16
Text Generation • Updated • 2.72k • 15
models
None public yet
datasets
6725
open-llm-leaderboard/dynamic_model_information
Updated
•
1
•
5
open-llm-leaderboard/requests
Updated
•
11
•
21
open-llm-leaderboard/details_mistralai__Mistral-7B-v0.3
Updated
open-llm-leaderboard/results
Updated
•
417
•
48
open-llm-leaderboard/details_kimdeokgi__merge_model_test2
Updated
open-llm-leaderboard/details_adamo1139__Yi-34B-200K-HESOYAM-0905
Updated
open-llm-leaderboard/details_xxx777xxxASD__ChaoticSoliloquy-4x8B
Updated
open-llm-leaderboard/details_DAMO-NLP-SG__CLEX-Mixtral-8x7B-Chat-32K
Updated
open-llm-leaderboard/details_TwT-6__cr-model
Updated
open-llm-leaderboard/details_LeroyDyer__Mixtral_AI_LCARS_
Updated