Bonsai Image · Ternary 4B — Unpacked FP16 Safetensors

FP16 safetensors (HuggingFace diffusers format) of the ternary Bonsai Image 4B model. This repo exists for users who want to run Bonsai Image with stock diffusers or other frameworks that don't yet support our low-bit packs natively. The ternary kernels live in MLX (Apple Silicon, 2-bit out of the box) and the gemlite low-bit GEMM stack (CUDA).

We strongly recommend using the optimized low-bit packs instead. The ternary format is where the Bonsai Image gains come from — a 6.4× transformer footprint reduction, sub-iPhone deployment, and ~5× faster inference vs the stock FP16 pipeline on Apple Silicon. This unpacked FP16 version is full-size and provides none of those advantages.

For the optimized ternary release packs (recommended):

For the smaller-footprint variant:

See the Bonsai Image Demo repo for one-command setup of either variant on Mac, Linux, or Windows.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for prism-ml/bonsai-image-ternary-4B-unpacked

Finetuned
(26)
this model
Finetunes
2 models

Spaces using prism-ml/bonsai-image-ternary-4B-unpacked 2

Collection including prism-ml/bonsai-image-ternary-4B-unpacked