Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a dataset 1 day ago

lewtun/ml-intern-sessions

liked a Space 2 days ago

huggingface/ml-intern-api

published a bucket 3 days ago

lewtun/qwen3-capybara-sft-static-fce858-bucket

View all activity

Organizations

buckets 39

lewtun/qwen3-capybara-sft-static-fce858-bucket

lewtun/huggingface-static-72676e-bucket

lewtun/huggingface-static-0ac062-bucket

lewtun/capybara-sft-static-1f4453-bucket

lewtun/qwen3-capybara-sft-static-f35000-bucket

lewtun/qwen3-capybara-sft-static-579e01-bucket

View 39 buckets

Posts 8

Post

5062

Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co/blog/open-r1/update-3

Articles 41

Article

83

The Open Source Community is backing OpenEnv for Agentic RL

View all Articles

Collections 6

View 6 collections

Papers 11

arxiv:2504.11354

arxiv:2504.05299

arxiv:2503.07572

arxiv:2502.02737

spaces 99

Capybara Sft Static 1f4453

View real-time project metrics with an interactive dashboard

Trackio Dashboard

Show interactive tracking visualizations

Trackio Dashboard

Qwen3 Capybara Sft Static F35000

Explore your data with an interactive tracking dashboard

Trackio Dashboard

Trackio Dashboard

Display interactive visualizations of tracked data

models 321

lewtun/qwen3-0.6b-capybara-smoke

Text Generation • 0.6B • Updated 9 days ago • 61

lewtun/qwen3-0.6b-capybara

Text Generation • 0.6B • Updated 9 days ago • 49

lewtun/qwen3-0.6b-capybara-1step

Text Generation • 0.6B • Updated 10 days ago • 61

lewtun/qwen3-0.6b-angrygiraffe-sft

Text Generation • 0.6B • Updated 13 days ago • 64

lewtun/qwen3-4b-hermes-tooluse

Text Generation • 4B • Updated 13 days ago • 48

lewtun/qwen3-0.6b-sft-capybara

Text Generation • 0.6B • Updated May 12 • 88

lewtun/smollm2-1.7b-capybara-sft

lewtun/qwen3-0.6b-openthoughts3-sft

lewtun/smollm2-135m-capybara-csv

Text Generation • 0.1B • Updated May 11 • 80 • 1

lewtun/smollm2-135m-capybara-jsonl

Text Generation • 0.1B • Updated May 11 • 82

View 321 models

datasets 96

lewtun/ml-intern-sessions

Traces • Updated 1 day ago • 439 • 4.56k • 3

lewtun/capybara-25-20260507

Viewer • Updated May 7 • 25 • 25

lewtun/capybara-25-20260506

Viewer • Updated May 6 • 25 • 15

lewtun/capybara-25

Viewer • Updated May 6 • 25 • 23

lewtun/capybara-100-2026-05-05

Viewer • Updated May 5 • 100 • 16

lewtun/capybara-100-test-2026-05-05

Updated May 5 • 9

lewtun/openthoughts-100

Updated May 5 • 36

lewtun/Capybara-100

Viewer • Updated May 5 • 100 • 38

lewtun/running-dashboard-data

Viewer • Updated May 3 • 16 • 7

lewtun/dolci-think-sft-6400

Viewer • Updated Mar 11 • 6.4k • 18

View 96 datasets