leaderboard / app.py

Commit History

Read evals code
595e24c

meg-huggingface commited on

Unhooking backend while I recompute results
b266265

meg-huggingface commited on

Trying to change spacing of metric options
65dc8d1

meg-huggingface commited on

Changing all tasks to bias-relevant tasks; creating backend.
a1ce55b

meg-huggingface commited on

UI
275d535

meg-huggingface commited on

UI
b14e11d

meg-huggingface commited on

UI
130a6d2

meg-huggingface commited on

UI stuff
1075b83

meg-huggingface commited on

UI stuff
3393abb

meg-huggingface commited on

UI stuff
3a48629

meg-huggingface commited on

UI stuff
37ad431

meg-huggingface commited on

Stretching out the size of the results
29ae773

meg-huggingface commited on

Adding debug print statements
313cc30

meg-huggingface commited on

Adding CPU support (float32) and some additional comments.
971bce4

meg-huggingface commited on

Merge and debugging submissione error
1e11692

meg-huggingface commited on

Merge
c3873b8

meg-huggingface commited on

fix
ff8b19d

Clémentine commited on

debug
f20e4c0

Clémentine commited on

debug
1fb88dc

Clémentine commited on

debug
d54df1a

Clémentine commited on

removed last restart
daf60ae

Clémentine commited on

simplified calls
50df158

Clémentine commited on

now with a functionning backend
1ffc326

Clémentine commited on

fix
1257fc3

Clémentine commited on

updated leaderboard
efeee6d

Clémentine commited on

Simplified leaderboard v0
9833cdb

Clémentine commited on

adding pull back
d084b26

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

make faster thanks to no concurrency limit
d4aa996

Clémentine commited on

fix order of request file vs request file list, to avoid resubmitting issues
976f398

Clémentine commited on

cache
4ff9eef

Clémentine commited on

update for caching
395eff6

Clémentine commited on

simplify launcher + remove dataframe warning on boolean columns
ab6f548

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Try concurrency management
bb149ba

Clémentine commited on

fix
be0d7e4

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

Update app.py
a163e5c

clefourrier HF staff commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

adding collections back
ae85651

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on

add new evals to the leaderboard
e3aaf53

Nathan Habib commited on

add safefail for when we cannot download datasets, will simply restart the space
26286b2

Nathan Habib commited on

token for checking gated base models
f3cda22

Clémentine commited on

Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
0f4fbd6

Nathan Habib commited on

reorg to simplify nav in code base
6e56e0d

Clémentine commited on

Creating functions for plotting results over time (#295)
f2bc0a5

clefourrier HF staff chriscanal commited on

added automatic update of the best LLM models
e295ac3

Clémentine commited on

reformat files, put metadata in request files
adb0416

Nathan Habib commited on