Qwen3.5-9B (+ generation_config.json)

Verbatim copy of Qwen/Qwen3.5-9B with the only change being the addition of a generation_config.json carrying both stop ids:

{"eos_token_id": [248046, 248044]}   // <|im_end|>, <|endoftext|>

Upstream Qwen/Qwen3.5-9B ships no generation_config.json, so inference engines (sglang/vLLM) fall back to config.json's eos = <|endoftext|> (248044) and never stop on the chat turn terminator <|im_end|> (248046) — causing runaway generation in multi-turn / tool-use rollouts. The 35B-A3B checkpoint ships this file; 4B/9B do not. This repo restores it so multi-turn generation halts correctly at each turn.

Downloads last month
300
Safetensors
Model size
10B params
Tensor type
BF16
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for friday-and-co/Qwen3.5-9B

Finetuned
Qwen/Qwen3.5-9B
Finetuned
(424)
this model