OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains Paper • 2606.14702 • Published 4 days ago • 24
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 6 days ago • 85
SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published 19 days ago • 38
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue Paper • 2605.30993 • Published 18 days ago • 58