Commits · lmms-lab/LiveBench

Refactor init_leaderboard function to update data outputs to dataframe and improve dropdown UI

ce8a069

pufanyi commited on Jul 14, 2024

Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns

83431d1

pufanyi commited on Jul 14, 2024

Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns

35850bf

pufanyi commited on Jul 14, 2024

Refactor auto_eval_column_dict to use "str" instead of "markdown" for the "Model Name" column

d969d3e

pufanyi commited on Jul 14, 2024

Refactor auto_eval_column_dict to set "Dataset Version" column as non-nullable

cb5cde2

pufanyi commited on Jul 14, 2024

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding

303626f

pufanyi commited on Jul 14, 2024

Refactor dataset version column name to lowercase in auto_eval_column_dict

f1d5836

pufanyi commited on Jul 14, 2024

Refactor init_leaderboard function to handle multiple subsets and improve column selection and hiding

09c7b10

pufanyi commited on Jul 14, 2024

Refactor load_dataset to include split parameter in populate.py

95e674a

pufanyi commited on Jul 14, 2024

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding

d9f262c

pufanyi commited on Jul 14, 2024

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding

4c839ed

pufanyi commited on Jul 14, 2024

chore: Refactor get_leaderboard_df to handle multiple subsets in populate.py

88477a4

pufanyi commited on Jul 14, 2024

chore: Update app.py to include select_columns and hide_columns in init_leaderboard function

37b74a1

pufanyi commited on Jul 14, 2024

chore: Update app.py to include select_columns and hide_columns in init_leaderboard function

ab7ee2d

pufanyi commited on Jul 14, 2024

chore: Round numeric columns to two decimal places in get_leaderboard_df

7660cbc

pufanyi commited on Jul 14, 2024

chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND

2f420b7

pufanyi commited on Jul 14, 2024

chore: Remove commented out code for model information in utils.py

903180b

pufanyi commited on Jul 14, 2024

chore: Update auto_eval_column_dict to use "Total" instead of "Overall" for the Overall column

15d3941

pufanyi commited on Jul 14, 2024

chore: Update model name in auto_eval_column_dict

d47aa6d

pufanyi commited on Jul 14, 2024

chore: Update search columns in app.py to include model and license names

94d4dbb

pufanyi commited on Jul 14, 2024

chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND

3c62a69

pufanyi commited on Jul 14, 2024

chore: Update Tasks enum values in about.py

046ddc7

pufanyi commited on Jul 14, 2024

Update GOOGLE_SHEET_ID in envs.py

93dabac

pufanyi commited on Jul 10, 2024

chore: Remove commented out code for model information in utils.py

d598d7d

pufanyi commited on Jul 10, 2024

chore: Remove commented out code for model information in utils.py

65654bf

pufanyi commited on Jul 10, 2024

chore: Update page title to "LiveBench"

8336bbd

pufanyi commited on Jul 9, 2024

chore: Update about page title to "Live Bench"

24c1f06

pufanyi commited on Jul 9, 2024

Revert "Update repository references in envs.py"

ce61fc8

pufanyi commited on Jul 9, 2024

Update repository references in envs.py

1d340cf

pufanyi commited on Jul 9, 2024

Update src/envs.py

adad63e
verified

clefourrier HF staff commited on Jul 9, 2024

added leaderboard component to simplify main script

8b28d2b

Clémentine commited on Jul 3, 2024

doc

c1b8a96

Clémentine commited on Apr 11, 2024

simplified the template

24622c4

Clémentine commited on Apr 11, 2024

CPU, TOKEN, env variables (#4)

55cc480
verified

clefourrier HF staff

meg HF staff commited on Jan 22, 2024

Update src/submission/check_validity.py

6eb8bfd

clefourrier HF staff commited on Jan 3, 2024

made token a requirement

f982b8e

Clémentine commited on Nov 23, 2023

test

f0298e1

Clémentine commited on Nov 23, 2023

fix

c15e77e

Clémentine commited on Nov 22, 2023

removed quantization to simplify

b899767

Clémentine commited on Nov 22, 2023

now with a functionning backend

1ffc326

Clémentine commited on Nov 22, 2023

update read

943f952

Clémentine commited on Nov 21, 2023

fixs

314f91a

Clémentine commited on Nov 21, 2023

updated leaderboard

efeee6d

Clémentine commited on Nov 21, 2023

Simplified leaderboard v0

9833cdb

Clémentine commited on Nov 21, 2023

simplified some parts of the code + updated requirements

9d22eee

Clémentine commited on Nov 21, 2023

Added check on tokenizer to prevent submissions which won't run

7302987

Clémentine commited on Nov 21, 2023

Update benchmark count and fix typo (`inetuning->finetuning`) (#395)

7abc6a7

clefourrier HF staff

alvarobartt HF staff commited on Nov 21, 2023

fix order of request file vs request file list, to avoid resubmitting issues

976f398

Clémentine commited on Nov 16, 2023

cache

4ff9eef

Clémentine commited on Nov 16, 2023

update for caching

395eff6

Clémentine commited on Nov 16, 2023

Commit History

Refactor init_leaderboard function to update data outputs to dataframe and improve dropdown UI ce8a069

Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns 83431d1

Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns 35850bf

Refactor auto_eval_column_dict to use "str" instead of "markdown" for the "Model Name" column d969d3e

Refactor auto_eval_column_dict to set "Dataset Version" column as non-nullable cb5cde2

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding 303626f

Refactor dataset version column name to lowercase in auto_eval_column_dict f1d5836

Refactor init_leaderboard function to handle multiple subsets and improve column selection and hiding 09c7b10

Refactor load_dataset to include split parameter in populate.py 95e674a

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding d9f262c

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding 4c839ed

chore: Refactor get_leaderboard_df to handle multiple subsets in populate.py 88477a4

chore: Update app.py to include select_columns and hide_columns in init_leaderboard function 37b74a1

chore: Update app.py to include select_columns and hide_columns in init_leaderboard function ab7ee2d

chore: Round numeric columns to two decimal places in get_leaderboard_df 7660cbc

chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND 2f420b7

chore: Remove commented out code for model information in utils.py 903180b

chore: Update auto_eval_column_dict to use "Total" instead of "Overall" for the Overall column 15d3941

chore: Update model name in auto_eval_column_dict d47aa6d

chore: Update search columns in app.py to include model and license names 94d4dbb

chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND 3c62a69

chore: Update Tasks enum values in about.py 046ddc7

Update GOOGLE_SHEET_ID in envs.py 93dabac

chore: Remove commented out code for model information in utils.py d598d7d

chore: Remove commented out code for model information in utils.py 65654bf

chore: Update page title to "LiveBench" 8336bbd

chore: Update about page title to "Live Bench" 24c1f06

Revert "Update repository references in envs.py" ce61fc8

Update repository references in envs.py 1d340cf

Update src/envs.py adad63e verified

added leaderboard component to simplify main script 8b28d2b

doc c1b8a96

simplified the template 24622c4

CPU, TOKEN, env variables (#4) 55cc480 verified

Update src/submission/check_validity.py 6eb8bfd

made token a requirement f982b8e

test f0298e1

fix c15e77e

removed quantization to simplify b899767

now with a functionning backend 1ffc326

update read 943f952

fixs 314f91a

updated leaderboard efeee6d

Simplified leaderboard v0 9833cdb

simplified some parts of the code + updated requirements 9d22eee

Added check on tokenizer to prevent submissions which won't run 7302987

Update benchmark count and fix typo (`inetuning->finetuning`) (#395) 7abc6a7

fix order of request file vs request file list, to avoid resubmitting issues 976f398

cache 4ff9eef

update for caching 395eff6

Refactor init_leaderboard function to update data outputs to dataframe and improve dropdown UI

ce8a069

Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns

83431d1

Refactor init_leaderboard function to handle multiple subsets, improve column selection and hiding, and include Dataset Version in filter_columns

35850bf

Refactor auto_eval_column_dict to use "str" instead of "markdown" for the "Model Name" column

d969d3e

Refactor auto_eval_column_dict to set "Dataset Version" column as non-nullable

cb5cde2

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding

303626f

Refactor dataset version column name to lowercase in auto_eval_column_dict

f1d5836

Refactor init_leaderboard function to handle multiple subsets and improve column selection and hiding

09c7b10

Refactor load_dataset to include split parameter in populate.py

95e674a

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding

d9f262c

Refactor get_leaderboard_df to handle multiple subsets and improve column selection and hiding

4c839ed

chore: Refactor get_leaderboard_df to handle multiple subsets in populate.py

88477a4

chore: Update app.py to include select_columns and hide_columns in init_leaderboard function

37b74a1

chore: Update app.py to include select_columns and hide_columns in init_leaderboard function

ab7ee2d

chore: Round numeric columns to two decimal places in get_leaderboard_df

7660cbc

chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND

2f420b7

chore: Remove commented out code for model information in utils.py

903180b

chore: Update auto_eval_column_dict to use "Total" instead of "Overall" for the Overall column

15d3941

chore: Update model name in auto_eval_column_dict

d47aa6d

chore: Update search columns in app.py to include model and license names

94d4dbb

chore: Update envs.py with EVAL_REQUESTS_PATH_BACKEND and EVAL_RESULTS_PATH_BACKEND

3c62a69

chore: Update Tasks enum values in about.py

046ddc7

Update GOOGLE_SHEET_ID in envs.py

93dabac

chore: Remove commented out code for model information in utils.py

d598d7d

chore: Remove commented out code for model information in utils.py

65654bf

chore: Update page title to "LiveBench"

8336bbd

chore: Update about page title to "Live Bench"

24c1f06

Revert "Update repository references in envs.py"

ce61fc8

Update repository references in envs.py

1d340cf

Update src/envs.py

adad63e
verified

added leaderboard component to simplify main script

8b28d2b

doc

c1b8a96

simplified the template

24622c4

CPU, TOKEN, env variables (#4)

55cc480
verified

Update src/submission/check_validity.py

6eb8bfd

made token a requirement

f982b8e

test

f0298e1

fix

c15e77e

removed quantization to simplify

b899767

now with a functionning backend

1ffc326

update read

943f952

fixs

314f91a

updated leaderboard

efeee6d

Simplified leaderboard v0

9833cdb

simplified some parts of the code + updated requirements

9d22eee

Added check on tokenizer to prevent submissions which won't run

7302987

Update benchmark count and fix typo (`inetuning->finetuning`) (#395)

7abc6a7

fix order of request file vs request file list, to avoid resubmitting issues

976f398

cache

4ff9eef

update for caching

395eff6