view post Post 1810 You can just ask things 🗣️"show me messages in the coding category that are in the top 10% of reward model scores"Download really high quality instructions from the Llama3.1 405B synthetic dataset 🔥 argilla/magpie-ultra-v1.0 See translation 🔥 6 6 👀 5 5 👍 1 1 + Reply
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 28 days ago • 62
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 28 days ago • 50