Qwopus3.6-27B-v2 MLX 4bit

4-bit MLX quantization of Jackrong/Qwopus3.6-27B-v2. Group size 64. The standard daily-driver quant for Apple Silicon.

Use

pip install mlx-lm
mlx_lm.generate --model zaydiscold/Qwopus3.6-27B-v2-MLX-4bit \
  --prompt "Explain quantum entanglement" --max-tokens 200 --temp 0.8

Author's temperature recommendation: 0.75–1.0.

The full MLX ladder

Variant Repo
MLX bf16 Qwopus3.6-27B-v2-MLX-bf16
MLX 4bit (this repo) this

More variants (8bit, 6bit, 3bit, group-size sweeps) shipping shortly.

Credits


Mix brought by the NOTORIOUS MLX.

Downloads last month
62
Safetensors
Model size
4B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zaydiscold/Qwopus3.6-27B-v2-MLX-4bit

Quantized
(22)
this model

Collection including zaydiscold/Qwopus3.6-27B-v2-MLX-4bit