T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

4-step Text-to-video Generation

With the style of low-poly game art, A majestic, white horse gallops gracefully across a moonlit beach. medium shot of Christine, a beautiful 25-year-old brunette resembling Selena Gomez, anxiously looking up as she walks down a New York street, cinematic style a cartoon pig playing his guitar, Andrew Warhol style
a dog wearing vr goggles on a boat Pikachu snowboarding a girl floating underwater

Model description πŸš€

This repository contains unet_lora.pt that can turn VideoCrafter2 into our T2V-Turbo (VC2). Our T2V-Turbo (VC2) can achieve both fast and high-quality T2V generation. On VBench, the 4-step generation from our T2V-Turbo (VC2) even outperform proprietary systems, including Gen-2 and Pika. Please refer to our GitHub repo for detailed instructions.

Misuse, Malicious Use and Excessive Use πŸ“–

Our model is meant for research purposes.

  • It is prohibited to generate content that is demeaning or harmful to people or their environment, culture, religion, etc.
  • Prohibited for pornographic, violent and bloody content generation.
  • Prohibited for error and false information generation.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Spaces using jiachenli-ucsb/T2V-Turbo-VC2 2

Collection including jiachenli-ucsb/T2V-Turbo-VC2