Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
open-llm-leaderboard
/
blog
like
102
Running
App
Files
Files
Community
2
c532f52
blog
/
assets
/
images
3 contributors
History:
2 commits
Clémentine
fixed table of contents and figures
fe3bc07
5 months ago
ifeval_score_per_model_type.png
Safe
42.5 kB
init
5 months ago
math_fn_gsm8k.png
Safe
25.2 kB
init
5 months ago
math_score_per_model_type.png
Safe
39.9 kB
init
5 months ago
normalized_vs_raw_scores.png
Safe
71.1 kB
init
5 months ago
ranking_top10_bottom10.png
Safe
104 kB
init
5 months ago
saturation.png
Safe
66 kB
fixed table of contents and figures
5 months ago
task_vs_mean.png
Safe
211 kB
init
5 months ago
timewise_analysis_full.png
Safe
133 kB
init
5 months ago
timewise_analysis_light.png
Safe
79.7 kB
init
5 months ago
v2_correlation_heatmap.png
Safe
67.5 kB
init
5 months ago
v2_fn_of_mmlu.png
Safe
40.1 kB
init
5 months ago