filtering the samples

#4
by ehartford - opened

how did you choose the 10k samples?

We sampled uniformly across each data source category. Data sour categories are detailed here - https://huggingface.co/datasets/teknium/OpenHermes-2.5?row=5.

aravindputrevu changed discussion status to closed

Sign up or log in to comment