Spaces:
Running
Running
Commit History
Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns
83431d1
Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns
35850bf
Refactor auto_eval_column_dict to use "str" instead of "markdown" for the "Model Name" column
d969d3e
Refactor auto_eval_column_dict to set "Dataset Version" column as non-nullable
cb5cde2
Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding
303626f
Refactor dataset version column name to lowercase in auto_eval_column_dict
f1d5836
Refactor init_leaderboard function to handle multiple subsets and improve column selection and hiding
09c7b10
Refactor load_dataset to include split parameter in populate.py
95e674a
Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding
d9f262c
Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding
4c839ed
chore: Refactor get_leaderboard_df to handle multiple subsets in populate.py
88477a4
chore: Update app.py to include select_columns and hide_columns in init_leaderboard function
37b74a1
chore: Update app.py to include select_columns and hide_columns in init_leaderboard function
ab7ee2d
chore: Round numeric columns to two decimal places in get_leaderboard_df
7660cbc
chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND
2f420b7
chore: Remove commented out code for model information in utils.py
903180b
chore: Update auto_eval_column_dict to use "Total" instead of "Overall" for the Overall column
15d3941
chore: Update model name in auto_eval_column_dict
d47aa6d
chore: Update search columns in app.py to include model and license names
94d4dbb
chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND
3c62a69
chore: Update Tasks enum values in about.py
046ddc7
Update GOOGLE_SHEET_ID in envs.py
93dabac
chore: Remove commented out code for model information in utils.py
d598d7d
chore: Remove commented out code for model information in utils.py
65654bf
chore: Update page title to "LiveBench"
8336bbd
chore: Update about page title to "Live Bench"
24c1f06
Revert "Update repository references in envs.py"
ce61fc8
Update repository references in envs.py
1d340cf
Update src/envs.py
adad63e
verified
added leaderboard component to simplify main script
8b28d2b
Clémentine
commited on
doc
c1b8a96
Clémentine
commited on
simplified the template
24622c4
Clémentine
commited on
CPU, TOKEN, env variables (#4)
55cc480
verified
Update src/submission/check_validity.py
6eb8bfd
made token a requirement
f982b8e
Clémentine
commited on
test
f0298e1
Clémentine
commited on
fix
c15e77e
Clémentine
commited on
removed quantization to simplify
b899767
Clémentine
commited on
now with a functionning backend
1ffc326
Clémentine
commited on
update read
943f952
Clémentine
commited on
fixs
314f91a
Clémentine
commited on
updated leaderboard
efeee6d
Clémentine
commited on
Simplified leaderboard v0
9833cdb
Clémentine
commited on
simplified some parts of the code + updated requirements
9d22eee
Clémentine
commited on
Added check on tokenizer to prevent submissions which won't run
7302987
Clémentine
commited on
Update benchmark count and fix typo (`inetuning->finetuning`) (#395)
7abc6a7
fix order of request file vs request file list, to avoid resubmitting issues
976f398
Clémentine
commited on
cache
4ff9eef
Clémentine
commited on
update for caching
395eff6
Clémentine
commited on