FrogMini-14B-2510-MLX-4bit

4-bit MLX quantization of microsoft/FrogMini-14B-2510 — Microsoft's 14B debugging/software-engineering finetune of Qwen3-14B (SFT on debugging trajectories; ~45% pass@1 on SWE-bench Verified). Converted with mlx-lm for Apple Silicon.

Other quantizations: 4-bit (this) · 8-bit

📚 Part of the Frog (SWE/debugging) MLX collection.

Reasoning model: emits <think>...</think> before the final answer.

Precision MLX affine 4-bit, group 64
Size 7.8 GB
Base arch Qwen3-14B (Qwen3) · 64K context
License MIT

Usage

mlx_lm.server --model pipenetwork/FrogMini-14B-2510-MLX-4bit --port 8080
from mlx_lm import load, generate
model, tok = load("pipenetwork/FrogMini-14B-2510-MLX-4bit")

Conversion

mlx_lm.convert --hf-path microsoft/FrogMini-14B-2510 --mlx-path <out> -q --q-bits 4 --q-group-size 64

Converted by pipenetwork. Original model & MIT license by Microsoft; not affiliated.

Downloads last month
13
Safetensors
Model size
15B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pipenetwork/FrogMini-14B-2510-MLX-4bit

Finetuned
Qwen/Qwen3-14B
Quantized
(7)
this model

Collection including pipenetwork/FrogMini-14B-2510-MLX-4bit