Qwen3-VL Text Encoders β€” INT8

INT8 quantized versions of the Qwen3-VL text encoders, for use in ComfyUI workflows (Boogu, Ideogram 4, and others that use Qwen3-VL as a text encoder).

Files

File Source ConvRot Group Size Notes
qwen3vl_4b_int8_convrot.safetensors true bf16 256 Best quality, full-strength ConvRot
qwen3vl_8b_int8_convrot.safetensors true bf16 16 One intermediate dim (4304) limits group size
qwen3vl_8b_int8_v2.safetensors true bf16 β€” (no ConvRot) Plain tensor-wise INT8

All quantized with silveroxides/convert_to_quant, --int8 --scaling_mode tensor --simple, embedding/lm_head/norm layers excluded and kept in BF16.

Important Note

⚠️ Do not use a text encoder requantized from an already-FP8-quantized source (FP8 β†’ INT8). Double quantization causes visible color/saturation drift, especially noticeable on few-step distilled models like Boogu Turbo. These files were quantized directly from true BF16 source weights (Comfy-Org/Qwen3-VL), avoiding that issue entirely.

Usage

Load via the standard Load CLIP node in ComfyUI, paired with the matching diffusion model (Boogu, Ideogram 4, etc.). Native INT8 support is available in recent ComfyUI builds; older builds need an INT8-compatible custom node.

Credits

πŸ’¬ Discord: discord.gg/CJv5wceJaN β˜• Ko-fi: ko-fi.com/winnougan

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support