|
--- |
|
license: other |
|
license_name: sacla |
|
license_link: >- |
|
https://huggingface.co/stabilityai/stable-diffusion-3.5-large/blob/main/LICENSE.md |
|
base_model: |
|
- stabilityai/stable-diffusion-3.5-large |
|
base_model_relation: quantized |
|
--- |
|
## Overview |
|
These models are made to work with [stable-diffusion.cpp](https://github.com/leejet/stable-diffusion.cpp) release [master-ac54e00](https://github.com/leejet/stable-diffusion.cpp/releases/tag/master-ac54e00) onwards. Support for other inference backends is not guarenteed. |
|
|
|
Quantized using this PR https://github.com/leejet/stable-diffusion.cpp/pull/447 |
|
|
|
Normal K-quants are not working properly with SD3.5-Large models because around 90% of the weights are in tensors whose shape doesn't match the 256 superblock size of K-quants and therefore can't be quantized this way. Mixing quantization types allows us to take adventage of the better fidelity of k-quants to some extent while keeping the model file size relatively small. |
|
|
|
## Files: |
|
|
|
### Mixed Types: |
|
|
|
|
|
- [sd3.5_large-q2_k_4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/sd3.5_large-q2_k_4_0.gguf): Smallest quantization yet. Use this if you can't afford anything bigger |
|
- [sd3.5_large-q3_k_4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/sd3.5_large-q3_k_4_0.gguf) |
|
- [sd3.5_large-q4_k_4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/sd3.5_large-q4_k_4_0.gguf): Exacty same size as q4_0, but with slightly less degradation. Recommended |
|
|
|
### Legacy types: |
|
|
|
- [sd3.5_large-q4_0.gguf](https://huggingface.co/stduhpf/SD3.5-Large-GGUF-mixed-sdcpp/blob/main/legacy/sd3.5_large-q4_0.gguf): Same size as q4_k_4_0, Not recommended (use q4_k_4_0 instead) |
|
|
|
|
|
## Outputs: |
|
|
|
Sorted by model size (Note that q4_0 and q4_k_4_0 are the exact same size) |
|
|
|
| Quantization | Robot girl | Text | Cute kitten | |
|
| ------------------ | -------------------------------- | ---------------------------------- | ---------------------------------- | |
|
| q2_k_4_0 | data:image/s3,"s3://crabby-images/6720a/6720af8c86cb988d31d9e5c4a859f49b13007e91" alt="q2_k_4_0" | data:image/s3,"s3://crabby-images/b6e2e/b6e2e8b6cf043c74375db35ad1468ef8c6930330" alt="q2_k_4_0" | data:image/s3,"s3://crabby-images/96d47/96d47c8aa6e28d866062f248a155fa05615e39d2" alt="q2_k_4_0" | |
|
| q3_k_4_0 | data:image/s3,"s3://crabby-images/03d56/03d5671e40d97dc97696886bf9aa27ce44f7351c" alt="q3_k_4_0" | data:image/s3,"s3://crabby-images/7991e/7991e312addbcde40eab61d4308ee1d5959f3403" alt="q3_k_4_0" | data:image/s3,"s3://crabby-images/c0e4b/c0e4b1c1cbb4159bae48641b9df0b55cbea43530" alt="q3_k_4_0" | |
|
| q4_0 | data:image/s3,"s3://crabby-images/5dcf5/5dcf56955527bbea0b7f712e25cb6f2904219d9a" alt="q4_0" | data:image/s3,"s3://crabby-images/c50b3/c50b3f36a96a0f3c0aa4970ca7450bd12ba2ecba" alt="q4_0" | data:image/s3,"s3://crabby-images/ed2b5/ed2b5afba88f78df9e0d0e9a07212622d1214882" alt="q4_0" | |
|
| q4_k_4_0 | data:image/s3,"s3://crabby-images/0c4fc/0c4fcd246d7841a8384643137a6ac52a4d251f22" alt="q4_k_4_0" | data:image/s3,"s3://crabby-images/6a628/6a628911413a8088974784baa1108ee959e1e75e" alt="q4_k_4_0" | data:image/s3,"s3://crabby-images/db813/db81330312ec70f82f8524537e59507e9aa12cd4" alt="q4_k_4_0" | |
|
|
|
Generated with a modified version of sdcpp with [this PR](https://github.com/leejet/stable-diffusion.cpp/pull/397) applied to enable clip timestep embeddings support. |
|
|
|
Text encoders used: q4_k quant of t5xxl, full precision clip_g, and q8 quant of [ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF](https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14) in place of clip_l. |
|
|
|
Full prompts and settings in png metadata. |
|
|