Andrea Soria

asoria

AI & ML interests

Maintainer of 🤗Datasets: Data processing

Articles

Organizations

asoria's activity

upvoted an article about 17 hours ago
view article
Article

Announcing New Dataset Search Features

13
upvoted an article 29 days ago
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
upvoted 2 articles about 2 months ago
view article
Article

Synthetic dataset generation techniques: generating custom sentence similarity data

13
view article
Article

Synthetic data: save money, time and carbon with open source

35
upvoted an article 2 months ago
view article
Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

64
upvoted an article 3 months ago
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

22
upvoted 2 articles 3 months ago
view article
Article

It's raining diffusion personalization techniques☔️🎭🖼️

By linoyts
16
view article
Article

DuckDB: run SQL queries on 50,000+ datasets on the Hugging Face Hub

3