RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper • 2412.11919 • Published 7 days ago • 33
ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper • 2412.11815 • Published 7 days ago • 26
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper • 2412.12606 • Published 6 days ago • 40
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published 7 days ago • 38
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models Paper • 2412.08629 • Published 11 days ago • 11
Mogo: RQ Hierarchical Causal Transformer for High-Quality 3D Human Motion Generation Paper • 2412.07797 • Published 18 days ago • 11
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Paper • 2412.07760 • Published 12 days ago • 49
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Paper • 2412.06781 • Published 13 days ago • 18
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published 12 days ago • 24
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models Paper • 2412.07674 • Published 13 days ago • 20
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 13 days ago • 45
NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training Paper • 2412.02030 • Published 20 days ago • 18
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Paper • 2412.03558 • Published 18 days ago • 14
Mimir: Improving Video Diffusion Models for Precise Text Understanding Paper • 2412.03085 • Published 19 days ago • 12
LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting Paper • 2412.00177 • Published 23 days ago • 7
One Shot, One Talk: Whole-body Talking Avatar from a Single Image Paper • 2412.01106 • Published 21 days ago • 18
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 18 days ago • 118
SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance Paper • 2412.02687 • Published 19 days ago • 109