Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

1102

UltraFeedback contamination with TruthfulQA

#361

by natolambert - opened Nov 9, 2023

Discussion

natolambert

Nov 9, 2023

Zephyr-7b-beta and a model we're building at AI2 are trained on this dataset which has TruthfulQA prompts https://huggingface.co/datasets/openbmb/UltraFeedback. Not sure the right way to filter these models, but it likely gives a not realistic boost in performance.

clefourrier

Open LLM Leaderboard org Nov 9, 2023

Hi!
Good to know, thank you for your comment - can you make a list of the models you'd like to flag for having TruthfulQA in their training set? (Plus ideally the sources for all models?)

clefourrier

Open LLM Leaderboard org Nov 26, 2023

Closing for inactivity

clefourrier changed discussion status to closed Nov 26, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment