PCM_Weights / README.md
wangfuyun's picture
Update README.md
4aeee8c verified
|
raw
history blame
1.67 kB
metadata
library_name: diffusers
pipeline_tag: text-to-image

Phased Consistency Model

LoRA weights of Stable Diffusion XL for fast text-to-image generation.

Update

  • [2024.06.03]: Converted all LoRA weights and merge the repo of Stable Diffusion v1-5 and Stable Diffusion XL. Add LCM-Like PCM LoRAs, functioning just like LCM but works better at low-step regime. Note LoRA is not sufficient for one-step generation.

Important Usage Guidance

  1. Use DDIM or Euler instead of LCM for sampling! When using DDIM, set timestep_spacing="trailing".

  2. The name of each LoRA weights indicates how many inference steps they should be applied.

  3. The name of each LoRA weights indicates whether they are able to use normal CFGs or small CFGs

    • NormalCFG means that model equipped with the LoRA can use CFG value 2-9 for generation. Yet you should adjust the CFG values given the steps you applied. When using fewer steps, you should use smaller CFGs. For example, use CFG 2.5 - 3.5 with 4 four steps and use CFG 3 - 6 with 8 steps. This is because that fewer-step means the model has fewer chance to fix the issues caused by the CFG.
    • SmallCFG means that the model equipped with the LoRA can use CFG value 1-2 for generation.

Note:

  • The normalCFG LoRAs are more sensitive to the prompts. Set proper positive and negative prompts for better quality.
  • Just find the normalCFG with 4-step is not working very well. Trying to solve the issue.

[paper] [arXiv] [code] [project page]