Instructions to use talzoomanzoo/qwen3-4b-base-math-32k with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use talzoomanzoo/qwen3-4b-base-math-32k with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-4B-Base") model = PeftModel.from_pretrained(base_model, "talzoomanzoo/qwen3-4b-base-math-32k") - Notebooks
- Google Colab
- Kaggle
Qwen3-4B Base math LoRA (32k)
LoRA adapter from math SFT at 32k context, trained on Qwen/Qwen3-4B-Base.
Checkpoint: step 228 (2 epochs).
Load
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
base = "Qwen/Qwen3-4B-Base"
adapter = "talzoomanzoo/qwen3-4b-base-math-32k"
tokenizer = AutoTokenizer.from_pretrained(base, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
base, torch_dtype="auto", device_map="auto", trust_remote_code=True
)
model = PeftModel.from_pretrained(model, adapter)
Training
- Base model: Qwen/Qwen3-4B-Base
- Method: LoRA SFT (r=64, alpha=128) via TRL
- Max sequence length: 32768
- Downloads last month
- 1
Model tree for talzoomanzoo/qwen3-4b-base-math-32k
Base model
Qwen/Qwen3-4B-Base