Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper • 2503.24379 • Published 6 days ago • 68
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers Paper • 2405.05945 • Published May 9, 2024 • 3
IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis Paper • 2210.00647 • Published Oct 2, 2022 • 1
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention Paper • 2503.19907 • Published 12 days ago • 8
SketchVideo: Sketch-based Video Generation and Editing Paper • 2503.23284 • Published 7 days ago • 20