math-think-s2-qwen3-4b

Light-R1 stage2 think SFT, continued from math-think-s1-v2. cutoff=24576, 1 epoch, lr=5e-6.

Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for modrill/math-think-s2-qwen3-4b

Finetuned
(319)
this model