Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
davanstrien 
posted an update Mar 4
Post
Today, we're launching an effort to empower the community to build impactful datasets collectively.

Good data is essential for the open-source AI community. Recently, Argilla and Hugging Face launched Data is Better Together. In less than two weeks, over 350 people ranked over 10k prompts.

Today, we're shifting our focus to help support other community efforts to create datasets using Argilla and Hugging Face Spaces. This workflow means anyone with a Hugging Face account can contribute to a dataset in less than a minute. We want to hear from anyone with ideas for creating important datasets as a community. This could include things like:

- Creating preference data for a language that lacks high-quality preference datasets.
- Building evaluation datasets for a new domain.
- Developing a dataset for a new task.

If you would like to get involved, join us in the #data-is-better-together Discord channel: https://discord.com/channels/879548962464493619/1205128865735770142.

You can read more in this blog post from @dvilasuero and I: https://huggingface.co/blog/community-datasets
In this post