Spaces:

HuggingFaceH4
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

754

Dataset for models confirmed to have training data contaminated with evaluation data

#214

by CoreyMorris - opened Aug 22, 2023

Discussion

CoreyMorris

Aug 22, 2023

This would be helpful for folks building off of the hugging face evaluation data https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard/discussions/174 . For now, manual removal is easy because there are just handful, but it would be nice to not have to track this and just have another dataset I can pull from for which models I should remove. If this wouldn't fit well into the hugging face workflow or they do not want to do it for some reason, I can upload a dataset.

clefourrier

Hugging Face H4 org Aug 23, 2023

Hi @CoreyMorris !
We won't go for an automatic process because it could allow people to flag models for the wrong reasons (ex: flag the model of a competitor).

However, we store the list manually here, so every discussion opened will lead to an addition in the file.

CoreyMorris

Aug 23, 2023

Gotcha. That list works. I'll go ahead and close this issue.

CoreyMorris changed discussion status to closed Aug 23, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment