Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Commit History
Update src/display/about.py
3c0cb66
verified
modified about.py
818ee3d
Minseok Bae
commited on
Modified about.py so that it displays (%) in columns.
5bcc476
Minseok Bae
commited on
Fixed the leaderboard filtering functionality. Modified filter_models() function in app.py/
1f26f6c
Minseok Bae
commited on
modified the evaluation pipelines.
2c24f05
Minseok Bae
commited on
Added citations
b46b972
Minseok Bae
commited on
Updated about.py
dbcffd4
Minseok Bae
commited on
Edited README and added reproducibility functionality in main_backend.py
f0b90cf
Minseok Bae
commited on
modified read_evals.py
c3e9147
Minseok Bae
commited on
Refine the code style
156ef43
Minseok Bae
commited on
Implemented litellm pipeline
2864204
Minseok Bae
commited on
Edited README and removed error-rate metric
404587d
Minseok Bae
commited on
modified is_model_on_hub()
3b66490
Minseok Bae
commited on
changed back to TOKEN
0c85a8e
Minseok Bae
commited on
changed to HF_TOKEN
a9a1c18
Minseok Bae
commited on
modified check_validity.py and added sample dataset to test functionality
099e4e2
Minseok Bae
commited on
Integrated backend pipelines - error occurs during model submission. (Debugging needed).
58b9de9
Minseok Bae
commited on
Modified for hallucination evaluation task
d7b7dc6
Minseok Bae
commited on
Update src/display/about.py
0baf5c4
update read
943f952
Clémentine
commited on
fixs
314f91a
Clémentine
commited on
updated leaderboard
efeee6d
Clémentine
commited on
Simplified leaderboard v0
9833cdb
Clémentine
commited on
simplified some parts of the code + updated requirements
9d22eee
Clémentine
commited on
Added check on tokenizer to prevent submissions which won't run
7302987
Clémentine
commited on
Update benchmark count and fix typo (`inetuning->finetuning`) (#395)
7abc6a7
fix order of request file vs request file list, to avoid resubmitting issues
976f398
Clémentine
commited on
cache
4ff9eef
Clémentine
commited on
update for caching
395eff6
Clémentine
commited on
add model architecture as column
3dfaf22
Clémentine
commited on
Simplify About
eaace79
Clémentine
commited on
Refactor 2 - added plotting back
b1a1395
Clémentine
commited on
fix value error in param size
ccefec9
Clémentine
commited on
Fix requirements for mistral models - to change once transformers gets updated.
002172c
Clémentine
commited on
fix col width
fc1e99b
Clémentine
commited on
refacto style + rate limit
df66f6e
Clémentine
commited on
Fix TruthfulQA NaN scores to 0
bb17be3
Clémentine
commited on
refacto part 1
2a5f9fb
Clémentine
commited on
add new evals to the leaderboard
e3aaf53
Nathan Habib
commited on
token for checking gated base models
f3cda22
Clémentine
commited on
Fix BibTex author ordering (#342)
216309b
fix disapearing models
280033c
Nathan Habib
commited on
Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
0f4fbd6
Nathan Habib
commited on
fix model display when fething metadata
624b3c8
Nathan Habib
commited on
reorg to simplify nav in code base
6e56e0d
Clémentine
commited on
should update index in collection as it goes
c212cb7
Clémentine
commited on
Creating functions for plotting results over time (#295)
f2bc0a5
update collection path
36bf18d
Clémentine
commited on
req test
06acefd
Clémentine
commited on