Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space 25 minutes ago

open-r1/open-r1-eval-leaderboard

updated a Space about 1 hour ago

open-r1/open-r1-eval-leaderboard

updated a Space about 1 hour ago

open-r1/open-r1-eval-leaderboard

View all activity

Organizations

lewtun's activity

upvoted a paper 3 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 3 days ago • 52

upvoted a paper 4 days ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published 8 days ago • 25

upvoted a paper 9 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 10 days ago • 27

upvoted a paper 12 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 14 days ago • 46

upvoted a paper 14 days ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 24 days ago • 40

upvoted an article 14 days ago

Article

Open R1: How to use OlympicCoder locally for coding?

15 days ago

• 56

upvoted a paper 16 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 16 days ago • 112

upvoted a paper about 1 month ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 33

upvoted a collection about 2 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 20 items • Updated 3 days ago • 118

upvoted a paper about 2 months ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 69

upvoted a collection about 2 months ago

OpenR1-Math

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 3 items • Updated 23 days ago • 7

upvoted a paper about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 216