SkyReels-A2: Compose Anything in Video Diffusion Transformers Paper • 2504.02436 • Published 4 days ago • 25
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper • 2503.24379 • Published 7 days ago • 68
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Paper • 2504.01724 • Published 5 days ago • 57
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Paper • 2504.01016 • Published 6 days ago • 27
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning Paper • 2503.21860 • Published 11 days ago • 3
MoCha: Towards Movie-Grade Talking Character Synthesis Paper • 2503.23307 • Published 9 days ago • 98
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper • 2503.23461 • Published 8 days ago • 87
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published 26 days ago • 68
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Paper • 2503.20785 • Published 12 days ago • 20
DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis Paper • 2503.15667 • Published 19 days ago • 8
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Paper • 2503.13444 • Published 21 days ago • 15
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Paper • 2503.20776 • Published 12 days ago • 8