Andrea Soria

asoria

AI & ML interests

Maintainer of 🤗Datasets: Data processing

Articles

Organizations

asoria's activity

upvoted an article 8 days ago
view article
Article

Synthetic dataset generation techniques: generating custom sentence similarity data

11
upvoted an article 17 days ago
view article
Article

Synthetic data: save money, time and carbon with open source

28
upvoted an article 28 days ago
view article
Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

55
upvoted an article about 1 month ago
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

20
upvoted 2 articles about 2 months ago
view article
Article

It's raining diffusion personalization techniques☔️🎭🖼️

By linoyts
16
view article
Article

DuckDB: run SQL queries on 50,000+ datasets on the Hugging Face Hub

2