
alibaba-pai/Wan2.1-Fun-1.3B-Control
Text-to-Video
•
Updated
•
22.2k
•
95
A unified multimodal understanding and generation model.
Generate images based on text prompts and condition images
High-quality speech synthesis powered by Kokoro TTS
Find matching images from a collection