Loubna Ben Allal's picture

Loubna Ben Allal

loubnabnl

·

https://loubnabnl.github.io/

AI & ML interests

SmolLMs, ML for code, data

Recent Activity

upvoted a collection 9 days ago

View all activity

Organizations

Posts 4

Post

3499

Making SmolLM2 reproducible: open-sourcing our training & evaluation toolkit 🛠️ https://github.com/huggingface/smollm/

- Pre-training code with nanotron
- Evaluation suite with lighteval
- Synthetic data generation using distilabel (powers our new SFT dataset HuggingFaceTB/smoltalk)
- Post-training scripts with TRL & the alignment handbook
- On-device tools with llama.cpp for summarization, rewriting & agents

Apache 2.0 licensed. V2 pre-training data mix coming soon!

Which other tools should we add next?

Articles 9

Article

286

Open R1: Update #3

View all Articles

Collections 5

Papers 10

arxiv:2504.05299

arxiv:2502.02737

arxiv:2406.17557

arxiv:2405.18392

spaces 6

Smol Playground

Zero Gpu

selfcheck

Nt3awnu Map

Diff Visualizer

Compare two texts and highlight differences

The Stack Bot

models 53

loubnabnl/SmolLM2-135M-Instruct-template2

Text Generation • Updated Feb 18 • 3

loubnabnl/Qwen-Math-1.5B-Bespoke-H4-5e5-4k-ep3

Text Generation • Updated Jan 25 • 4

loubnabnl/Qwen-Math-1.5B-Instruct-Bespoke-H4-3e5-4k-ep3

Text Generation • Updated Jan 25 • 5

loubnabnl/Qwen-Math-1.5B-Instruct-Bespoke-H4-5e5-4k-ep3

Text Generation • Updated Jan 25 • 4

loubnabnl/Qwen-Math-1.5B-Bespoke-H4-1e4-4k-ep3

Text Generation • Updated Jan 25 • 6

loubnabnl/Qwen-Math-1.5B-Instruct-Bespoke-H4-1e4-4k-ep3

Text Generation • Updated Jan 25 • 3

loubnabnl/Llama-8B-Bespoke-H4-GBS500k-lr-3e-5

Text Generation • Updated Jan 25 • 4

loubnabnl/Llama-8B-Instruct-Bespoke-H4-GBS500k-lr5e-5

Text Generation • Updated Jan 25 • 4

loubnabnl/Llama-8B-Instruct-Bespoke-H4-GBS500k-lr1e-4

Text Generation • Updated Jan 25 • 6

loubnabnl/Llama-8B-Instruct-Bespoke-H4-GBS250k-lr2e-5

Text Generation • Updated Jan 25 • 3

datasets 100

loubnabnl/evals_csv

Viewer • Updated Mar 3 • 920 • 23

loubnabnl/135M-examples

Updated Feb 9 • 22

loubnabnl/mmlu-evals-smollm-360m

Viewer • Updated Dec 22, 2024 • 1 • 16 • 1

loubnabnl/code_data

Viewer • Updated Dec 22, 2024 • 1k • 26

loubnabnl/english-web-100k

Viewer • Updated Dec 22, 2024 • 100k • 39

loubnabnl/generations_dataset_sysprompt

Viewer • Updated Sep 20, 2024 • 41 • 86

loubnabnl/gens-360M-temp7-v2

Viewer • Updated Aug 18, 2024 • 41 • 28

loubnabnl/gens-360M-v2

Viewer • Updated Aug 17, 2024 • 41 • 23

loubnabnl/gens-135M-v2

Viewer • Updated Aug 17, 2024 • 41 • 45 • 1

loubnabnl/generations_dataset

Viewer • Updated Aug 17, 2024 • 40 • 37