Commit History

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on

add new evals to the leaderboard
e3aaf53

Nathan Habib commited on

token for checking gated base models
f3cda22

Clémentine commited on

Fix BibTex author ordering (#342)
216309b

clefourrier HF staff lewtun HF staff commited on

fix disapearing models
280033c

Nathan Habib commited on

Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
0f4fbd6

Nathan Habib commited on

fix model display when fething metadata
624b3c8

Nathan Habib commited on

reorg to simplify nav in code base
6e56e0d

Clémentine commited on

should update index in collection as it goes
c212cb7

Clémentine commited on

Creating functions for plotting results over time (#295)
f2bc0a5

clefourrier HF staff chriscanal commited on

update collection path
36bf18d

Clémentine commited on

req test
06acefd

Clémentine commited on

added automatic update of the best LLM models
e295ac3

Clémentine commited on

reformat files, put metadata in request files
adb0416

Nathan Habib commited on

updated GPTQ display!
5491f2d

Clémentine commited on

Update src/display_models/model_metadata_type.py
ed118a6

clefourrier HF staff commited on

fix model search
ef5b51c

Clémentine commited on

Added restrictor on model cards and licenses
b93d1b1

Clémentine commited on

Fix search bar by not filtered models with unknown model type
f485a37

Nathan Habib commited on

flagging AIDC-ai-business models
2ef734a

Nathan Habib commited on

updated caching to include size of models
5933808

Clémentine commited on

Updated model info to get number of parameters in almost all cases, even without safetensors
301c384

Clémentine commited on

Fix merge
5da3025

Nathan Habib commited on

Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
bc44e71

Nathan Habib commited on

use cache for model metadata
0799cf8

Nathan Habib commited on

fixes display for model params that are 0
6e79cea

Clémentine commited on

type updated
1d49827

Clémentine commited on

fix list of finished models to include models waiting for new eval
eed1ccd

Nathan Habib commited on

Fix sorting of model files by date, added extra fields if we need the info later
5228101

Clémentine commited on

Added rate limiting system to the leaderboard to prevent abuse
3777786

Clémentine commited on

flagged gaodrew/gaodrew-gorgonzola-13b
77c51de

clefourrier HF staff commited on

fix bug in metadata display
e0b891a

Clémentine commited on

fix of the model type display
9e0f1e6

Clémentine commited on

Update src/display_models/model_metadata_type.py
6d1329a

clefourrier HF staff commited on

Flagging TigerResearch/tigerbot-7b-sft-v1
ff09f56

clefourrier HF staff commited on

fix submit different revisions
49a4ed6

Clémentine commited on

Flagging Fredithefish/ReasonixPajama-3B-HF
bbd101d

clefourrier HF staff commited on

Update src/display_models/model_metadata_type.py
8541f99

clefourrier HF staff commited on

Cleaned and refactored the code, improved filtering, added selection of deleted models
8c49cb6

Clémentine commited on

do not display models with an empty metric result
ba25d90

Clémentine commited on

Set to 0 if metric not found (#220)
c6b775f

osanseviero Xenova HF staff commited on

added 'forbidden models' submission, to allow orgs to request their models to not be submitted in case of contamination
ed1fdef

Clémentine commited on

simplified header text
6fefae4

Clémentine commited on

Flagged model per discussion
6e039c4

Clémentine commited on

removed need for tokens in the leaderboard + removed skull in flagged models
a40c960

Clémentine commited on

Adding flagging systemi, removing changelog
699e8ff

Clémentine commited on

Adding link to detailed results and evals (#203)
6254b87

clefourrier HF staff commited on