Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
CoreyMorris
/
MMLU-by-task-Leaderboard
like
13
Running
App
Files
Files
Community
4
7681250
MMLU-by-task-Leaderboard
4 contributors
History:
131 commits
Corey Morris
updated dev requirements
7681250
12 months ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
.gitignore
63 Bytes
updated gitignore
12 months ago
.gitmodules
106 Bytes
added hugging face evaluation harness results submodule
about 1 year ago
README.md
248 Bytes
initial commit
about 1 year ago
app.py
15.7 kB
Updated model count
12 months ago
contaminated_models.csv
117 Bytes
Updated contaminated models
12 months ago
contaminated_models.txt
65 Bytes
Updated contaminated models
12 months ago
details_data_processor.py
4.04 kB
updated pipeline and init
12 months ago
dev_requirements.txt
130 Bytes
updated dev requirements
12 months ago
requirements.txt
199 Bytes
updated requirements.txt
12 months ago
result_data_processor.py
5.94 kB
removing models that are known to have training data contaminated with evaluations
12 months ago
save_for_regression.py
1.86 kB
changed to save and load in a directory
12 months ago
test_details_data_processing.py
4.33 kB
added a test
12 months ago
test_integration.py
1.96 kB
fixed test_streamlit_app_runs
12 months ago
test_regression.py
1.26 kB
added todo for test
12 months ago
test_result_data_processing.py
1.66 kB
Added organization to dataframe
12 months ago