FrogBoss-32B-2510-MLX-4bit

4-bit MLX quantization of microsoft/FrogBoss-32B-2510 — Microsoft's 32B debugging/software-engineering finetune of Qwen3-32B (SFT on debugging trajectories; ~45% pass@1 on SWE-bench Verified). Converted with mlx-lm for Apple Silicon.

Other quantizations: 4-bit (this) · 8-bit

📚 Part of the Frog (SWE/debugging) MLX collection.

Precision MLX affine 4-bit, group 64
Size 17 GB
Base arch Qwen3-32B (Qwen3) · 64K context
License MIT

Usage

mlx_lm.server --model pipenetwork/FrogBoss-32B-2510-MLX-4bit --port 8080
from mlx_lm import load, generate
model, tok = load("pipenetwork/FrogBoss-32B-2510-MLX-4bit")

Conversion

mlx_lm.convert --hf-path microsoft/FrogBoss-32B-2510 --mlx-path <out> -q --q-bits 4 --q-group-size 64

Converted by pipenetwork. Original model & MIT license by Microsoft; not affiliated.

Downloads last month
15
Safetensors
Model size
33B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pipenetwork/FrogBoss-32B-2510-MLX-4bit

Base model

Qwen/Qwen3-32B
Quantized
(7)
this model

Collection including pipenetwork/FrogBoss-32B-2510-MLX-4bit