Pierre-Carl Langlais's picture

Pierre-Carl Langlais

Pclanglais

·

Dorialexander

AI & ML interests

Open data & open LLMs

Recent Activity

updated a model about 14 hours ago

PleIAs/350m_fiction

updated a model 1 day ago

PleIAs/pleias_wikidata_2

published a model 1 day ago

PleIAs/350m_fiction

View all activity

Organizations

Pclanglais's activity

upvoted 2 papers 4 months ago

UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages

Paper • 2411.14343 • Published Nov 21, 2024 • 7

Toxicity of the Commons: Curating Open-Source Pre-Training Data

Paper • 2410.22587 • Published Oct 29, 2024 • 10

upvoted an article 7 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 335

upvoted a collection 9 months ago

Common Pile

Datasets in the Common Pile. • 25 items • Updated Oct 29, 2024 • 5

upvoted a collection 12 months ago

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 124