Video-Guided Foley Sound Generation with Multimodal Controls Paper • 2411.17698 • Published 29 days ago • 7
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait Paper • 2412.01064 • Published 24 days ago • 25
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Paper • 2412.01169 • Published 24 days ago • 11
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Paper • 2412.09501 • Published 14 days ago • 43