Daniel Vila PRO

dvilasuero

AI & ML interests

RLHF, RLAIF, DPO, data, data, data

Articles

Organizations

dvilasuero's activity

upvoted 2 articles 11 days ago
view article
Article

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

6
upvoted an article 15 days ago
view article
Article

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

14
upvoted an article 19 days ago
view article
Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

25