Daniel Vila's picture

Daniel Vila PRO

dvilasuero

·

https://argilla.io

dvsrepo

AI & ML interests

RLHF, RLAIF, DPO, data, data, data

Articles

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Data is better together

Organizations

Posts 7

Post

5455

Today is a huge day in Argilla’s history. We couldn’t be more excited to share this with the community: we’re joining Hugging Face!

We’re embracing a larger mission, becoming part of a brilliant and kind team and a shared vision about the future of AI.

Over the past year, we’ve been collaborating with Hugging Face on countless projects: launching partner of Docker Spaces, empowering the community to clean Alpaca translations into Spanish and other languages, launching argilla/notus-7b-v1 building on Zephyr’s learnings, the Data is Better Together initiative with hundreds of community contributors, or releasing argilla/OpenHermesPreferences, one of the largest open preference tuning datasets

After more than 2,000 Slack messages and over 60 people collaborating for over a year, it already felt like we were part of the same team, pushing in the same direction. After a week of the smoothest transition you can imagine, we’re now the same team.

To those of you who’ve been following us, this won’t be a huge surprise, but it will be a big deal in the coming months. This acquisition means we’ll double down on empowering the community to build and collaborate on high quality datasets, we’ll bring full support for multimodal datasets, and we’ll be in a better place to collaborate with the Open Source AI community. For enterprises, this means that the Enterprise Hub will unlock highly requested features like single sign-on and integration with Inference Endpoints.

As a founder, I am proud of the Argilla team. We're now part of something bigger and a larger team but with the same values, culture, and goals. Grateful to have shared this journey with my beloved co-founders Paco and Amélie.

Finally, huge thanks to the Chief Llama Officer @osanseviero for sparking this and being such a great partner during the acquisition process.

Would love to answer any questions you have so feel free to add them below!

Post

🔥 Community and Data Quality Are More For Alignment

A recipe to replicate SPIN (Self-Play Fine Tuning) with 30x less data:

🗣️ 50K samples vs 1.8K prompts curated by the 350+ amazing DIBT contributors.
⚗️ Distillation of Mistral Large instead of OpenAI
🙌 Open data & code with ⚗️distilabel

SPIN Paper:
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models (2401.01335)

SPIN DIBT Collection with datasets and models:
argilla/dibt-prompt-collective-spin-65ef59062518776024395fc3

Repo:
https://github.com/argilla-io/distilabel-spin-dibt

Joint work with the amazing DIBT community 👇
@aashish1904 , @flozi00 , @sayhan , @munish0838 , @0-hero , @dvilasuero , @eren23 , @davanstrien , @ahnz , @BlackKakapo , @kitano-o , @mmhamdy , @sdiazlor , @Stopwolf , @gabrielmbmb , @tculler91 , @plaguss , @ignacioct , @Hugi-R , @davidberenstein1957 , @Korla , @alvarobartt , @Hugs4Llamas , @Sumandora , @nataliaElv , @jfcalvo , @Averill , @steventrouble , @vasilis , @aeros93 , @kayyshf , @thomasgauthier , @jeromebas , @Ameeeee , @ayoubelmhamdi , @TuringsSolutions , @efels , @Haleyok , @abrazador , @emessy , @Nindaleth , @burtenshaw , @vicgalle , @CortexPE , @casey-martin , @Leire-aguirre-eguiluz , @mrfakename , @Portias600kNeurons , @nathaliepett , @Filippo

spaces 26

Domain Specific Seed

Domain Specific Seed

Argilla Space Template

Argilla Space Template

Argilla Space Template

Argilla Space Template

models 33

dvilasuero/DistilabelOpenHermes-2.5-mistral-7b-mix2

Text Generation • Updated Feb 6 • 1

dvilasuero/Distilabel-OpenHermes-2.5-Mistral-7B-v2

Text Generation • Updated Feb 4

dvilasuero/Capystral-Mistral-7B-Instruct

Text Generation • Updated Jan 29 • 1

dvilasuero/CapMistral-7B-Instruct-1epoch

Text Generation • Updated Jan 28 • 1

dvilasuero/CapMistral-7B-Instruct

Text Generation • Updated Jan 28 • 1

dvilasuero/Distilabel-Nous-Capybara-7B-V1.9

Text Generation • Updated Jan 27 • 1

dvilasuero/DistilabeledHermes-2.5-Mistral-7B

Text Generation • Updated Jan 25 • 34

dvilasuero/phi2-lora-quantized-distilabel-intel-orca-dpo-pairs

Updated Jan 25 • 1

dvilasuero/DistilabelBeagle14-7B

Text Generation • Updated Jan 25 • 2 • 2

dvilasuero/distilabeled-Magicoder-S-DS-6.7B

Text Generation • Updated Jan 23 • 1

datasets 45

dvilasuero/finest-tuning

Viewer • Updated 12 days ago

dvilasuero/None-parsed

Viewer • Updated May 8

dvilasuero/distillama3-prompts10k-comparison

Viewer • Updated May 4

dvilasuero/distillama3-prompts10k-final

Viewer • Updated May 3

dvilasuero/distillama3-prompts10k

Viewer • Updated May 2 • 7

dvilasuero/distillama3-prompts10k-all

Viewer • Updated May 2

dvilasuero/distillama3-prompts10k-gpt4

Viewer • Updated Apr 30 • 1

dvilasuero/human-rights

Preview • Updated Apr 30

dvilasuero/distillama3-prompts10k-fix

Viewer • Updated Apr 25

dvilasuero/reasoning

Viewer • Updated Apr 25