Sylvain Lesage PRO

severo

AI & ML interests

Maintainer of 🤗 Datasets Server. Specific interests: data visualization, geospatial data.

Articles

Organizations

severo's activity

upvoted an article 5 days ago
view article
Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

28
upvoted an article 11 days ago
view article
Article

Enhancing Search Capabilities for Non-English Datasets in the Dataset Viewer

By asoria
4
upvoted an article 12 days ago
view article
Article

Experimenting with Automatic PII Detection on the Hub using Presidio

19
upvoted an article 14 days ago
view article
Article

Announcing New Dataset Search Features

17
upvoted 2 articles about 1 month ago
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
upvoted 3 articles about 2 months ago
view article
Article

FiftyOne Computer Vision Datasets Come to the Hugging Face Hub

By jamarks
12
view article
Article

Wikipedia's Treasure Trove: Advancing Machine Learning with Diverse Data

By frimelle
12
upvoted 2 articles 2 months ago
view article
Article

Synthetic data: save money, time and carbon with open source

37
upvoted 5 articles 3 months ago
view article
Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

28
view article
Article

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

12
view article
Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

20
view article
Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

41
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

22