Commit History

Update src/display/about.py
8a6bfdc
verified

ofermend commited on

Update src/display/about.py
02cd86f
verified

ofermend commited on

Update src/display/about.py
56492c3
verified

ofermend commited on

Update src/display/about.py
6472dd8
verified

ofermend commited on

Update src/display/about.py
2a8e044
verified

ofermend commited on

Update src/display/about.py
b92e0da
verified

ofermend commited on

Updated bibtex
418a002
verified

minseokbae commited on

Updated bibtex
31b8757
verified

minseokbae commited on

Added bibtex
5ead597
verified

minseokbae commited on

Updated bibtex citation
bac5383
verified

minseokbae commited on

Update src/display/about.py
e2aca33
verified

ofermend commited on

Update src/display/about.py
3c0cb66
verified

ofermend commited on

modified about.py
818ee3d

Minseok Bae commited on

Modified about.py so that it displays (%) in columns.
5bcc476

Minseok Bae commited on

Fixed the leaderboard filtering functionality. Modified filter_models() function in app.py/
1f26f6c

Minseok Bae commited on

modified the evaluation pipelines.
2c24f05

Minseok Bae commited on

Added citations
b46b972

Minseok Bae commited on

Updated about.py
dbcffd4

Minseok Bae commited on

Edited README and added reproducibility functionality in main_backend.py
f0b90cf

Minseok Bae commited on

modified read_evals.py
c3e9147

Minseok Bae commited on

Refine the code style
156ef43

Minseok Bae commited on

Implemented litellm pipeline
2864204

Minseok Bae commited on

Edited README and removed error-rate metric
404587d

Minseok Bae commited on

modified is_model_on_hub()
3b66490

Minseok Bae commited on

changed back to TOKEN
0c85a8e

Minseok Bae commited on

changed to HF_TOKEN
a9a1c18

Minseok Bae commited on

modified check_validity.py and added sample dataset to test functionality
099e4e2

Minseok Bae commited on

Integrated backend pipelines - error occurs during model submission. (Debugging needed).
58b9de9

Minseok Bae commited on

Modified for hallucination evaluation task
d7b7dc6

Minseok Bae commited on

Update src/display/about.py
0baf5c4

ofermend commited on

update read
943f952

Clémentine commited on

fixs
314f91a

Clémentine commited on

updated leaderboard
efeee6d

Clémentine commited on

Simplified leaderboard v0
9833cdb

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

Added check on tokenizer to prevent submissions which won't run
7302987

Clémentine commited on

Update benchmark count and fix typo (`inetuning->finetuning`) (#395)
7abc6a7

clefourrier HF staff alvarobartt HF staff commited on

fix order of request file vs request file list, to avoid resubmitting issues
976f398

Clémentine commited on

cache
4ff9eef

Clémentine commited on

update for caching
395eff6

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Simplify About
eaace79

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

fix value error in param size
ccefec9

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on

add new evals to the leaderboard
e3aaf53

Nathan Habib commited on