FastHunyuan / README.md
PY007's picture
Upload folder using huggingface_hub
91950d8 verified
|
raw
history blame
1.49 kB
---
pipeline_tag: text-to-video
license: other
license_name: tencent-hunyuan-community
license_link: LICENSE
---
<p align="center">
<img src="assets/logo.jpg" height=100>
</p>
# FastHunyuan Model Card
## Model Details
FastHunyuan is an accelerated [HunyuanVideo](https://huggingface.co/tencent/HunyuanVideo) model. It can sample high quality videos with 6 diffusion steps. That brings around 8X speed up compared to the original HunyuanVideo with 50 steps.
- **Developed by** [Hao AI Lab](https://hao-ai-lab.github.io/)
- **License** tencent-hunyuan-community
- **Distilled from** [HunyuanVideo](https://huggingface.co/tencent/HunyuanVideo)
## [Github Repository](https://github.com/hao-ai-lab/FastVideo)
## Usage
- Clone [Fastvideo](https://github.com/hao-ai-lab/FastVideo) repository and follow the inference instructions in the README.
- Alternatively, you can inference FastHunyuan using the official [Hunyuan Video repository](https://github.com/Tencent/HunyuanVideo) by **setting the shift to 17 and steps to 6**.
## Training details
FastHunyuan is consistency distillated on the [MixKit](https://huggingface.co/datasets/LanguageBind/Open-Sora-Plan-v1.1.0/tree/main) dataset with the following hyperparamters:
- Batch size: 16
- Resulotion: 720x1280
- Num of frames: 125
- Train steps: 320
- GPUs: 32
- LR: 1e-6
- Loss: huber
## Evaluation
We provide some qualitative comparison between FastHunyuan 6 step inference v.s. the original Hunyuan with 6 step inference: