open_llm_leaderboard / src /leaderboard

Commit History

wip
0c7ef71

Clémentine commited on

Update src/leaderboard/read_evals.py
3b554b5

clefourrier HF staff commited on

Incorrectly tagged merges are now flagged
90fa47e

Clémentine commited on

Added checkbox for merges
b762711

Clémentine commited on

flag model
991b9e1

Nathan Habib commited on

flag model
511d367

Nathan Habib commited on

adding merge check - super slow but at least info is displayed
20b060e

Clémentine commited on

flag models
c841f87

Nathan Habib commited on

flag models
425be57

Nathan Habib commited on

flag models
d93b3d2

Nathan Habib commited on

flag models
42f5749

Nathan Habib commited on

flag models
71834c1

Nathan Habib commited on

flag models
c1d0f7f

Nathan Habib commited on

nathan-flagged-models-vis (#478)
460ecf2

clefourrier HF staff commited on

added flag
783ccc5

Clémentine commited on

added tigerbot models to do not submit per authors request
202d26e

Clémentine commited on

flagging tiger models
e629df0

Nathan Habib commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on