Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
l3lab
's Collections
L1
miniCTX
L1
updated
Mar 7
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Upvote
6
l3lab/L1-Qwen-1.5B-Max
Updated
Mar 7
•
3.36k
•
14
l3lab/L1-Qwen-1.5B-Exact
Updated
Apr 7
•
9.64k
•
5
Upvote
6
+2
Share collection
View history
Collection guide
Browse collections