math-think-s1-v2-qwen3-4b

Light-R1 stage1 think SFT. cutoff=24576, 2 epoch, lr=1e-5, packing on.

Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for modrill/math-think-s1-v2-qwen3-4b

Finetuned
(319)
this model