math-nothink-strip-qwen3-4b

Full-parameter SFT of Qwen3-4B-Base for math reasoning (short CoT, no thinking blocks).

Ablation: same Light-R1 questions as math think, with `` blocks stripped.

  • Dataset: math_nothink_strip (75,649 samples)
  • Template: qwen3_nothink
  • Epochs: 2 | LR: 3e-5 | Cutoff: 8192
Downloads last month
19
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for modrill/math-nothink-strip-qwen3-4b

Finetuned
(321)
this model