Commit History

sort results by date
b521fe9

djstrong commited on

change bele task
b5aa7e1

djstrong commited on

revert to 0
d488d58

djstrong commited on

print missing metadata models
45fa708

djstrong commited on

add perplexity
0d2a785

djstrong commited on

add metadata and filters
ed33da8

djstrong commited on

remove dtype
ccd6234

djstrong commited on

remove trust remote code
92530b5

djstrong commited on

speakleash
1be4fc9

djstrong commited on

model name links
e03efff

djstrong commited on

change model name
ea6148c

djstrong commited on

spichlerz org
2a37ba0

djstrong commited on

disable not working filters
45d02c6

djstrong commited on

more info
3c5ea13

djstrong commited on

clickable condition
1889818

djstrong commited on

n-shot filter
d2d2329

djstrong commited on

add n-shot param
3bb301b

djstrong commited on

add 5-shot
1f30b67

djstrong commited on

0-shot description
d6e3be2

djstrong commited on

only 0-shot
a8630b1

djstrong commited on

simplify model names
b15949c

djstrong commited on

now with a functionning backend
1ffc326

Clémentine commited on

update read
943f952

Clémentine commited on

fixs
314f91a

Clémentine commited on

Simplified leaderboard v0
9833cdb

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on