Diffusion Single File
comfyui

Anima's INT8 Acceleration Plugin and Advanced Timestep Parameters

#197
by kongbai-84 - opened

The INT8 acceleration plugin can significantly increase generation speed on GPUs with strong INT8 computing power, such as the RTX 30 series, while keeping the quality almost unchanged. When generating with an RTX 3090 GPU, the generation time is reduced to 0.5-0.6x compared to when it is disabled.

Disabled

image (24)

Enabled

image (25)

Original plugin: https://github.com/SparknightLLC/ComfyUI-INT8-Toolkit
forge-neo: https://github.com/kongbai-0421/anima-int8-toolkit

The advanced timestep plugin allows for the configuration of advanced timestep sampling parameters.
https://github.com/kongbai-0421/anima-timestep-advanced

Looking forward to better FP8 support though. I get nowhere near 2x speed when using FP8 quants :(

INT8 only improved performance by 10% (1.66s/it -> 1.49s/it) on my B580, but reduced VRAM usage by 4G.
The overall image quality hasn’t dropped significantly, but there’s a marked difference in the finer details.

Sign up or log in to comment