AnimateLCM-SVD-xt / README.md
wangfuyun's picture
Update README.md
8969e35 verified
|
raw
history blame
1.88 kB
metadata
pipeline_tag: image-to-video

Samples generated by AnimateLCM-SVD-xt

Introduction

Consistency Distilled Stable Video Diffusion Image2Video-XT (SVD-xt) following the strategy proposed in AnimateLCM-paper. AnimateLCM-SVD-xt can generate good quality image-conditioned videos with 25 frames in 2~8 steps with 576x1024 resolutions.

Computation comparsion

AnimateLCM-SVD-xt can generally produces demos with good quality in 4 steps without requiring the classifier-free guidance, and therefore can save 25 x 2 / 4 = 12.5 times compuation resources compared with normal SVD models.

Demos

Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1

Please contact Fu-Yun Wang (fywang@link.cuhk.edu.hk) for the inference code and the scheduler design. I might respond a bit later. Thank you!