Qwenjamin Franklin

Qwenjamin Franklin 4bit

Qwenjamin Franklin 4bit is the fused Apple Silicon release of the strongest everyday-use branch from the Qwenjamin Franklin workshop line.

It is built from Qwen 3.5 9B and tuned for compact coding help, stricter JSON and tool behavior, and stronger false-premise correction while staying local-first in MLX.

If you want the follow-on release with a more expanded model card, see stamsam/Qwenjamin_Franklin_V2 and its compact sibling stamsam/Qwenjamin_Franklin_V2_4bit.

What This Release Is

  • Fused MLX 4-bit model
  • Base lineage: Qwen/Qwen3.5-9B
  • Workshop branch lineage: v14 broad-benchmark daily-driver
  • Best fit: Apple Silicon local use where size and speed matter

Base vs This Model

Internal workshop evals. These scores are project-specific and directional, not public leaderboard claims.

Eval Base Qwen3.5-9B-MLX-4bit Qwenjamin Franklin 4bit
workbench_local_agent_smoke 63/100 72/100
full40 309/400 325/400
json_hard 15/30 30/30
parser_gate 2/3, 1/3, 1/3 3/3, 3/3, 3/3
code_smoke 95/120 95/120
false_smoke 102/110 110/110
tool_schema_canary 50/175 106/175
no_tool_leakage 99/100 100/100

Usage

python -m mlx_lm generate \
  --model stamsam/Qwenjamin_Franklin_4bit \
  --prompt "Return only valid JSON." \
  --max-tokens 256 \
  --temp 0.0

Notes

  • This is the compact Apple Silicon release.
  • For strict JSON or code-only tasks, use explicit output instructions in the prompt.
  • Verify important outputs before using them in high-stakes workflows.
Downloads last month
28
Safetensors
Model size
1B params
Tensor type
BF16
·
U32
·
F32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for stamsam/Qwenjamin_Franklin_4bit

Finetuned
Qwen/Qwen3.5-9B
Quantized
(217)
this model

Collection including stamsam/Qwenjamin_Franklin_4bit