KumoVideo: Video Generation with Mixed Attention Transformer
KumoVideo is an open-source video generation framework based on Diffusion Transformer (DiT). We introduce MADiT (Mix Attention DiT), an advanced architecture that incorporates mix attention mechanisms to further elevate video generation quality and efficiency. KumoVideo demonstrates exceptional capability in producing high-fidelity, visually engaging, and temporally coherent videos, making it highly versatile for diverse applications across various domains. This repository empowers users to perform inference on KumoVideo using consumer-grade GPUs, democratizing access to cutting-edge video generation technology. We are honored to have open-sourced KumoVideo for the benefit of the community and remain dedicated to continuously refining and advancing the project to address the evolving demands of users and developers. For more details, please refer to link.
- Downloads last month
- 0