SDXL-Lightning ONNX (WebGPU, fp16)

ONNX build of ByteDance/SDXL-Lightning (4-step) for in-browser inference via onnxruntime-web (WebGPU). Includes the UNet, both text encoders, VAE and tokenizers. UNet is re-sharded into <2 GB parts.

Used by the Generate AI Images extension (local SDXL generation in the browser, no server).

  • Resolution: 1024ร—1024
  • Steps: 4โ€“6, guidance ~1.5
  • I/O: fp32 (fp16 internals)
  • Size: ~6.9 GB

License

Derivative work combining three upstream sources (all permissive; commercial use permitted, subject to the RAIL++-M use restrictions):

The combined work is released under CreativeML Open RAIL++-M (the most restrictive of the three).

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for d0gr/sdxl-lightning-onnx-webgpu

Quantized
(3)
this model