Long-Context Autoregressive Video Modeling with Next-Frame Prediction Paper • 2503.19325 • Published 30 days ago • 72
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published Feb 14 • 56
Pixel-Space Post-Training of Latent Diffusion Models Paper • 2409.17565 • Published Sep 26, 2024 • 22
Imagine yourself: Tuning-Free Personalized Image Generation Paper • 2409.13346 • Published Sep 20, 2024 • 70
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions Paper • 2407.06358 • Published Jul 8, 2024 • 19
MotionClone: Training-Free Motion Cloning for Controllable Video Generation Paper • 2406.05338 • Published Jun 8, 2024 • 42
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper • 2406.06525 • Published Jun 10, 2024 • 71
Improved Distribution Matching Distillation for Fast Image Synthesis Paper • 2405.14867 • Published May 23, 2024 • 16