Sylvain Lesage PRO

severo

AI & ML interests

Maintainer of 🤗 Datasets Server. Specific interests: data visualization, geospatial data.

Articles

Organizations

severo's activity

upvoted an article 3 days ago
upvoted an article 4 days ago
view article
Article

Synthetic data: save money, time and carbon with open source

21
upvoted an article 19 days ago
view article
Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

25
upvoted an article 22 days ago
view article
Article

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

10
upvoted 3 articles about 1 month ago
view article
Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

20
view article
Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

17
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

20