Qwen3-1.7B post-trained with rejection-sampling fine-tuning (thinking preserved) for mathematical reasoning. Final answers are wrapped in \boxed{}.
Chat template
Files info
Base model