Qwen3.5-9B-GGUF

File:

  • qwen3.5-9b-mix.gguf

Quantization summary:

  • token_embd/output = Q4_K
  • attn_qkv = Q4_K
  • full-attention attn_q/k/output = Q4_K
  • ssm_alpha/ssm_beta = Q6_K
  • attn_gate = Q4_0
  • ssm_out = Q4_0
  • ffn_down = Q4_0
  • attention-block ffn_gate/up = Q4_K
  • SSM-block ffn_gate/up = Q3_K

Size:

  • quant size: 4605.35 MiB
Downloads last month
2
GGUF
Model size
9B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for keyuan01/Qwen3.5-9B-GGUF

Finetuned
Qwen/Qwen3.5-9B
Quantized
(296)
this model