Spaces:
Sleeping
Sleeping
Commit History
Read evals code
595e24c
meg-huggingface
commited on
Unhooking backend while I recompute results
b266265
meg-huggingface
commited on
Trying to change spacing of metric options
65dc8d1
meg-huggingface
commited on
Changing all tasks to bias-relevant tasks; creating backend.
a1ce55b
meg-huggingface
commited on
UI
275d535
meg-huggingface
commited on
UI
b14e11d
meg-huggingface
commited on
UI
130a6d2
meg-huggingface
commited on
UI stuff
1075b83
meg-huggingface
commited on
UI stuff
3393abb
meg-huggingface
commited on
UI stuff
3a48629
meg-huggingface
commited on
UI stuff
37ad431
meg-huggingface
commited on
Stretching out the size of the results
29ae773
meg-huggingface
commited on
Adding debug print statements
313cc30
meg-huggingface
commited on
Adding CPU support (float32) and some additional comments.
971bce4
meg-huggingface
commited on
Merge and debugging submissione error
1e11692
meg-huggingface
commited on
Merge
c3873b8
meg-huggingface
commited on
fix
ff8b19d
Clémentine
commited on
debug
f20e4c0
Clémentine
commited on
debug
1fb88dc
Clémentine
commited on
debug
d54df1a
Clémentine
commited on
removed last restart
daf60ae
Clémentine
commited on
simplified calls
50df158
Clémentine
commited on
now with a functionning backend
1ffc326
Clémentine
commited on
fix
1257fc3
Clémentine
commited on
updated leaderboard
efeee6d
Clémentine
commited on
Simplified leaderboard v0
9833cdb
Clémentine
commited on
adding pull back
d084b26
Clémentine
commited on
simplified some parts of the code + updated requirements
9d22eee
Clémentine
commited on
make faster thanks to no concurrency limit
d4aa996
Clémentine
commited on
fix order of request file vs request file list, to avoid resubmitting issues
976f398
Clémentine
commited on
cache
4ff9eef
Clémentine
commited on
update for caching
395eff6
Clémentine
commited on
simplify launcher + remove dataframe warning on boolean columns
ab6f548
Clémentine
commited on
add model architecture as column
3dfaf22
Clémentine
commited on
Try concurrency management
bb149ba
Clémentine
commited on
fix
be0d7e4
Clémentine
commited on
Refactor 2 - added plotting back
b1a1395
Clémentine
commited on
Update app.py
a163e5c
fix col width
fc1e99b
Clémentine
commited on
refacto style + rate limit
df66f6e
Clémentine
commited on
adding collections back
ae85651
Clémentine
commited on
refacto part 1
2a5f9fb
Clémentine
commited on
add new evals to the leaderboard
e3aaf53
Nathan Habib
commited on
add safefail for when we cannot download datasets, will simply restart the space
26286b2
Nathan Habib
commited on
token for checking gated base models
f3cda22
Clémentine
commited on
Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
0f4fbd6
Nathan Habib
commited on
reorg to simplify nav in code base
6e56e0d
Clémentine
commited on
Creating functions for plotting results over time (#295)
f2bc0a5
added automatic update of the best LLM models
e295ac3
Clémentine
commited on