PleIAs

Team

company

Activity Feed

AI & ML interests

Open Science LLMs

Recent Activity

carlosrosas published a model 12 days ago

PleIAs/Pleias-SLM-RAG

carlosrosas updated a model 12 days ago

PleIAs/Pleias-SLM-RAG

Pclanglais updated a dataset 26 days ago

PleIAs/Telco-Common-Corpus

View all activity

Organization Card

Community About org cards

PleIAs is a French private AI Lab training the next generation of Language Models for document processing.

PleIAs is committed to open science and has coordinated the release of some of the largest open corpus for pre-training.

For more information, visit our website : https://pleias.fr/

Collections 11

View 11 collections

spaces 7

baguettotron_demo

📜

Vintage OCR Corrector (GPU)

📜

Correct OCR errors in your text

Vintage OCR Corrector (CPU)

📜

Correct OCR errors in text

Finance Commons Explorer

💻

Browse finance datasets on Hugging Face

Reversed-Zotero

📜

View 7 Spaces

models 31

datasets 60

PleIAs/Telco-Common-Corpus

Viewer • Updated 26 days ago • 1.78M • 1.16k • 3

PleIAs/French-Science-Commons

Viewer • Updated Jun 10 • 42.6M • 4.48k • 23

PleIAs/BSF_Redline

Viewer • Updated Jun 3 • 1.05M • 471

PleIAs/telecom-knowledge-base

Viewer • Updated May 13 • 4.68M • 40 • 1

PleIAs/SYNTH

Viewer • Updated May 6 • 68M • 6k • 272

PleIAs/common_corpus

Viewer • Updated May 6 • 69.9k • 73.8k • 410

PleIAs/CommonLingua-Train

Viewer • Updated Apr 28 • 2.76M • 73 • 15

PleIAs/Japanese-PD

Viewer • Updated Feb 16 • 1.38M • 504 • 1

PleIAs/Arabic-PD

Viewer • Updated Feb 16 • 221k • 375

PleIAs/verse-wikisource

Preview • Updated Nov 11, 2025 • 46 • 3

View 60 datasets