PCM_Weights / README.md
wangfuyun's picture
Update README.md
c578155 verified
|
raw
history blame
No virus
1.28 kB
metadata
library_name: diffusers
pipeline_tag: text-to-image

Phased Consistency Model

LoRA weights of Stable Diffusion XL for fast text-to-image generation.

Important Usage Guidance

  1. Use DDIM or Euler instead of LCM for sampling! When using DDIM, set timestep_spacing="trailing".

  2. The name of each LoRA weights indicates how many inference steps they should be applied.

  3. The name of each LoRA weights indicates whether they are able to use normal CFGs or small CFGs

    • NormalCFG means that model equipped with the LoRA can use CFG value 2-9 for generation. Yet you should adjust the CFG values given the steps you applied. When using fewer steps, you should use smaller CFGs. For example, use CFG 2.5 - 3.5 with 4 four steps and use CFG 3 - 6 with 8 steps. This is because that fewer-step means the model has fewer chance to fix the issues caused by the CFG.
    • SmallCFG means that the model equipped with the LoRA can use CFG value 1-2 for generation.

Note: Just find the normalCFG with 4-step is not working very well. Trying to solve the issue.

[paper] [arXiv] [code] [project page]