File size: 3,036 Bytes
6fe5258 a7f5769 6fe5258 a7f5769 6fe5258 a7f5769 60260b3 a7f5769 6fe5258 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 |
---
license: other
license_name: sacla
license_link: >-
https://huggingface.co/stabilityai/stable-diffusion-3.5-large/blob/main/LICENSE.md
base_model:
- stabilityai/stable-diffusion-3.5-large
base_model_relation: quantized
---
## Overview
These models are made to work with [stable-diffusion.cpp](https://github.com/leejet/stable-diffusion.cpp) release [master-ac54e00](https://github.com/leejet/stable-diffusion.cpp/releases/tag/master-ac54e00) onwards. Support for other inference backends is not guarenteed.
Quantized using this PR https://github.com/leejet/stable-diffusion.cpp/pull/447
Normal K-quants are not working properly with SD3.5-Large models because around 90% of the weights are in tensors whose shape doesn't match the 256 superblock size of K-quants and therefore can't be quantized this way. Mixing quantization types allows us to take adventage of the better fidelity of k-quants to some extent while keeping the model file size relatively small.
## Files:
### Mixed Types:
- [sd3.5_large-q2_k_4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/sd3.5_large-q2_k_4_0.gguf): Smallest quantization yet. Use this if you can't afford anything bigger
- [sd3.5_large-q3_k_4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/sd3.5_large-q3_k_4_0.gguf)
- [sd3.5_large-q4_k_4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/sd3.5_large-q4_k_4_0.gguf): Exacty same size as q4_0, but with slightly less degradation. Recommended
### Legacy types:
- [sd3.5_large-q4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/legacy/sd3.5_large-q4_0.gguf): Same size as q4_k_4_0, Not recommended (use q4_k_4_0 instead)
## Outputs:
Sorted by model size (Note that q4_0 and q4_k_4_0 are the exact same size)
| Quantization | Robot girl | Text | Cute kitten |
| ------------------ | -------------------------------- | ---------------------------------- | ---------------------------------- |
| q2_k_4_0 | data:image/s3,"s3://crabby-images/b111a/b111aa46a11bead2fb5ff327afb2859078021d5c" alt="q2_k_4_0" | data:image/s3,"s3://crabby-images/6f2a0/6f2a0c0da424cc157f5a8999de2e9f2846cafe5c" alt="q2_k_4_0" | data:image/s3,"s3://crabby-images/56ad9/56ad9de7639941d244a717f295f63147bfee33c6" alt="q2_k_4_0" |
| q3_k_4_0 | data:image/s3,"s3://crabby-images/fd6d6/fd6d6d43e24931efd8808be2a6dd1702d1252ea9" alt="q3_k_4_0" | data:image/s3,"s3://crabby-images/43271/432716f8760658e8ec0ff361513ccf8bc1a1170a" alt="q3_k_4_0" | data:image/s3,"s3://crabby-images/f2612/f2612edcb6c6af1abb0fe46c0dfbf02bc942f4d2" alt="q3_k_4_0" |
| q4_0 | data:image/s3,"s3://crabby-images/bfbb2/bfbb231b7eab1f84e505a32f5566b5d49f3413fa" alt="q4_0" | data:image/s3,"s3://crabby-images/d6372/d6372902dbc67e89446ee97df97818a04b1e1416" alt="q4_0" | data:image/s3,"s3://crabby-images/65cf5/65cf5bd42dd45251bf9935f473d5dd240de81f6b" alt="q4_0" |
| q4_k_4_0 | data:image/s3,"s3://crabby-images/e2a7e/e2a7eb77f7f4568f2c45b1ba027bae4581b15d25" alt="q4_k_4_0" | data:image/s3,"s3://crabby-images/1d448/1d4489f7a96f2c338e6c747fa5e96b28963728b3" alt="q4_k_4_0" | data:image/s3,"s3://crabby-images/9d64c/9d64c058e980f20060688bdd4a0df99950c27dfb" alt="q4_k_4_0" |
Generated with a modified version of sdcpp with [this PR](https://github.com/leejet/stable-diffusion.cpp/pull/397) applied to enable clip timestep embeddings support.
Text encoders used: q4_k quant of t5xxl, full precision clip_g, and q8 quant of [ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF](https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14) in place of clip_l.
Full prompts and settings in png metadata.
|