Instructions to use SceneWorks/sdxl-base-mlx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use SceneWorks/sdxl-base-mlx with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir sdxl-base-mlx SceneWorks/sdxl-base-mlx
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
Stable Diffusion XL base 1.0 β MLX pre-quantized tiers
Pre-quantized, packed-load tiers of
stabilityai/stable-diffusion-xl-base-1.0
for on-device Apple-Silicon inference with SceneWorks / mlx-gen
(the sdxl generator). Each tier is a self-contained diffusers turnkey snapshot (U-Net + both
CLIP text encoders + VAE + tokenizers + scheduler + model_index.json) that loads directly β no
in-app quantization pass, no dense transient.
Dual CLIP-L + OpenCLIP-bigG text encoders, real classifier-free guidance + negative prompt, sdxl-family LoRA support. ~30 steps at guidance 7.0, native 1024Γ1024.
Tiers
| dir | precision | what's quantized |
|---|---|---|
q4/ (default) |
group-wise affine Q4, group size 64 | U-Net Linears + both CLIP encoders |
q8/ |
group-wise affine Q8, group size 64 | U-Net Linears + both CLIP encoders |
bf16/ |
dense (full-precision master) | nothing β verbatim source mirror |
The VAE stays dense (f32) in every tier (the SDXL VAE is int8/fp16-unstable). Convolutions,
GroupNorms, and the CLIP token/position embeddings also stay dense; only the true Linear projections
are packed. Quantization is byte-identical to mlx-gen's load-time nn.quantize (bf16 cast, group
64).
Usage
use mlx_gen::{LoadSpec, WeightsSource, Quant};
let spec = LoadSpec::new(WeightsSource::Dir("β¦/sdxl-base-mlx/q4".into())).with_quant(Quant::Q4);
let g = mlx_gen::load("sdxl", &spec)?;
License
openrail++ (CreativeML Open RAIL++-M) β inherited from the source model
stabilityai/stable-diffusion-xl-base-1.0. See LICENSE.
Quantized
Model tree for SceneWorks/sdxl-base-mlx
Base model
stabilityai/stable-diffusion-xl-base-1.0