No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published 5 days ago • 36
ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper • 2412.11815 • Published 5 days ago • 26
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 8 days ago • 129
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published 9 days ago • 20
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Paper • 2412.07760 • Published 11 days ago • 49
Moto: Latent Motion Token as the Bridging Language for Robot Manipulation Paper • 2412.04445 • Published 16 days ago • 21
Monet: Mixture of Monosemantic Experts for Transformers Paper • 2412.04139 • Published 16 days ago • 10
4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion Paper • 2412.04462 • Published 16 days ago • 7
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published 16 days ago • 9
One Shot, One Talk: Whole-body Talking Avatar from a Single Image Paper • 2412.01106 • Published 20 days ago • 18
SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance Paper • 2412.02687 • Published 18 days ago • 109
FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion Paper • 2411.18552 • Published 24 days ago • 17
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Paper • 2411.19108 • Published 23 days ago • 17
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published 25 days ago • 34
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters Paper • 2411.18197 • Published 24 days ago • 14
MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation Paper • 2411.17945 • Published 25 days ago • 24
OminiControl: Minimal and Universal Control for Diffusion Transformer Paper • 2411.15098 • Published 29 days ago • 53