Stable Diffusion XL base 1.0 β€” MLX pre-quantized tiers

Pre-quantized, packed-load tiers of stabilityai/stable-diffusion-xl-base-1.0 for on-device Apple-Silicon inference with SceneWorks / mlx-gen (the sdxl generator). Each tier is a self-contained diffusers turnkey snapshot (U-Net + both CLIP text encoders + VAE + tokenizers + scheduler + model_index.json) that loads directly β€” no in-app quantization pass, no dense transient.

Dual CLIP-L + OpenCLIP-bigG text encoders, real classifier-free guidance + negative prompt, sdxl-family LoRA support. ~30 steps at guidance 7.0, native 1024Γ—1024.

Tiers

dir precision what's quantized
q4/ (default) group-wise affine Q4, group size 64 U-Net Linears + both CLIP encoders
q8/ group-wise affine Q8, group size 64 U-Net Linears + both CLIP encoders
bf16/ dense (full-precision master) nothing β€” verbatim source mirror

The VAE stays dense (f32) in every tier (the SDXL VAE is int8/fp16-unstable). Convolutions, GroupNorms, and the CLIP token/position embeddings also stay dense; only the true Linear projections are packed. Quantization is byte-identical to mlx-gen's load-time nn.quantize (bf16 cast, group 64).

Usage

use mlx_gen::{LoadSpec, WeightsSource, Quant};
let spec = LoadSpec::new(WeightsSource::Dir("…/sdxl-base-mlx/q4".into())).with_quant(Quant::Q4);
let g = mlx_gen::load("sdxl", &spec)?;

License

openrail++ (CreativeML Open RAIL++-M) β€” inherited from the source model stabilityai/stable-diffusion-xl-base-1.0. See LICENSE.

Downloads last month

-

Downloads are not tracked for this model. How to track
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for SceneWorks/sdxl-base-mlx

Finetuned
(1199)
this model