Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
demo-leaderboard-backend/leaderboard
evalitahf
/
evalita_llm_leaderboard
like
10
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
evalita_llm_leaderboard
360 kB
3 contributors
History:
123 commits
rzanoli
Fix: format best model scores to one decimal place
178e582
3 days ago
src
Fix prompt text and answer choices inconsistencies
about 1 month ago
src_maia
Revise the measurement description for MAIA
3 days ago
.gitattributes
Safe
1.53 kB
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
.gitignore
Safe
136 Bytes
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
.pre-commit-config.yaml
Safe
1.53 kB
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
Makefile
Safe
208 Bytes
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
README.md
Safe
1.48 kB
Small changes
9 months ago
app.py
Safe
61.6 kB
Fix: format best model scores to one decimal place
3 days ago
app_18_09_2025.py
Safe
33.7 kB
Refactor and optimize all interface chart code
3 months ago
app_22_09_2025.py
Safe
25.3 kB
Add performance metrics labels with average, std dev, and best model info.
3 months ago
app_30_09_2025.py
Safe
32.2 kB
Add heatmap and model comparison table
2 months ago
example_app.py
Safe
13.9 kB
Small changes
9 months ago
example_app2.py
Safe
9.8 kB
Small changes
9 months ago
get_model_info.py
Safe
5.39 kB
Small changes
9 months ago
preprocess_models_output.py
Safe
8.88 kB
Small changes to preporcess vision model files
2 months ago
preprocess_models_output_old.py
Safe
7.03 kB
Small changes
9 months ago
pyproject.toml
Safe
548 Bytes
Duplicate from demo-leaderboard-backend/leaderboard
9 months ago
requirements.txt
Safe
211 Bytes
Add the plotly library for creating charts
4 months ago
run_instructions.txt
Safe
2.92 kB
Updated documentation description for the pipeline to produce leaderboard data.
4 days ago