RealVisXL V5.0 Lightning β€” MLX pre-quantized tiers

Pre-quantized, packed-load tiers of SG161222/RealVisXL_V5.0_Lightning for on-device Apple-Silicon inference with SceneWorks / mlx-gen (the sdxl generator). Each tier is a self-contained diffusers turnkey snapshot (U-Net + both CLIP text encoders + VAE + tokenizers + scheduler + model_index.json) that loads directly β€” no in-app quantization pass, no dense transient.

A few-step distilled SDXL-Lightning photoreal checkpoint (openrail++, commercial-OK, ungated) β€” a standalone sibling of RealVisXL V5.0 tuned for ~5-step generation, roughly 6Γ— faster than the 30-step base. Runs CFG-free by default; text-to-image only.

Tiers

dir precision what's quantized
q4/ (default) group-wise affine Q4, group size 64 U-Net Linears + both CLIP encoders
q8/ group-wise affine Q8, group size 64 U-Net Linears + both CLIP encoders
bf16/ dense (full-precision master) nothing β€” verbatim source mirror

The VAE stays dense (f32) in every tier (the SDXL VAE is int8/fp16-unstable). Convolutions, GroupNorms, and the CLIP token/position embeddings also stay dense; only the true Linear projections are packed. Quantization is byte-identical to mlx-gen's load-time nn.quantize (bf16 cast, group 64).

Usage

use mlx_gen::{LoadSpec, WeightsSource, Quant};
let spec = LoadSpec::new(WeightsSource::Dir("…/realvisxl-lightning-mlx/q4".into())).with_quant(Quant::Q4);
let g = mlx_gen::load("sdxl", &spec)?;

License

openrail++ β€” inherited from the source model SG161222/RealVisXL_V5.0_Lightning. See LICENSE.

Downloads last month

-

Downloads are not tracked for this model. How to track
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for SceneWorks/realvisxl-lightning-mlx

Finetuned
(2)
this model