Rio-3.1-Open-30B-MLX-6bit

MLX (Apple Silicon) conversion of prefeitura-rio/Rio-3.1-Open-30B (Qwen3-MoE, 128 experts), quantized to 6-bit. First MLX build of this model.

Quantizations

Part of the Rio-3.1-Open-30B MLX collection.

Variant Notes
8-bit 8-bit · near-lossless
6-bit (this repo) 6-bit · high quality
5-bit 5-bit
4-bit 4-bit · balanced default

Use with mlx-lm

pip install mlx-lm
python -m mlx_lm generate --model pipenetwork/Rio-3.1-Open-30B-MLX-6bit --prompt "Olá, tudo bem?" -m 256

Validation

Smoke-tested locally: loads and generates coherent text.

License

MIT (inherited from base). Quantization config: {"group_size": 64, "bits": 6, "mode": "affine", "model.layers.0.mlp.gate": {"group_size": 64, "bits": 8}, "model.layers.1.mlp.gate": {"group_size": 64, "bits": 8}, "model.layers.2.mlp.gate": {"group_s.

Downloads last month
47
Safetensors
Model size
31B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

6-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pipenetwork/Rio-3.1-Open-30B-MLX-6bit

Quantized
(6)
this model

Collection including pipenetwork/Rio-3.1-Open-30B-MLX-6bit