FastHunyuan / README.md
PY007's picture
Upload folder using huggingface_hub
2c110fc verified
|
raw
history blame
1.93 kB
metadata
pipeline_tag: text-to-video
license: other
license_name: tencent-hunyuan-community
license_link: LICENSE

FastHunyuan Model Card

Model Details

FastHunyuan is an accelerated HunyuanVideo model. It can sample high quality videos with 6 diffusion steps. That brings around 8X speed up compared to the original HunyuanVideo with 50 steps.

Usage

  • Clone Fastvideo repository and follow the inference instructions in the README.
  • Alternatively, you can inference FastHunyuan using the official Hunyuan Video repository by setting the shift to 17 and steps to 6.

Training details

FastHunyuan is consistency distillated on the MixKit dataset with the following hyperparamters:

  • Batch size: 16
  • Resulotion: 720x1280
  • Num of frames: 125
  • Train steps: 320
  • GPUs: 32
  • LR: 1e-6
  • Loss: huber

Evaluation

We provide some qualitative comparison between FastHunyuan 6 step inference v.s. the original Hunyuan with 6 step inference:

FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step