Running 275 275 Qwen2.5 Omni 7B Demo 🏆 Generate text and speech responses from text, images, or audio input
Video-Guided Foley Sound Generation with Multimodal Controls Paper • 2411.17698 • Published Nov 26, 2024 • 10
MoCha: Towards Movie-Grade Talking Character Synthesis Paper • 2503.23307 • Published 29 days ago • 131
Running 542 542 Kolors Portrait With Flux 🤗 Kolors Portrait to keep face identity developed with Flux