Nested Attention: Semantic-aware Attention Values for Concept Personalization Paper • 2501.01407 • Published 3 days ago • 7
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Paper • 2501.01320 • Published 3 days ago • 8
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Paper • 2501.00599 • Published 5 days ago • 31
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Paper • 2501.01423 • Published 3 days ago • 30
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 3 days ago • 39
sayakpaul/cartoon-control-lr_1e-4-wd_1e-4-gs_10.0-cd_0.1 Text-to-Image • Updated 19 minutes ago • 6 • 3
cyberagent/opencole-typographylmm-llava-v1.5-7b-lora Image-Text-to-Text • Updated May 9, 2024 • 27 • 6
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition Paper • 2412.19712 • Published 9 days ago • 14
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published 12 days ago • 17
SpotLight: Shadow-Guided Object Relighting via Diffusion Paper • 2411.18665 • Published Nov 27, 2024 • 3
MotiF: Making Text Count in Image Animation with Motion Focal Loss Paper • 2412.16153 • Published 16 days ago • 6
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Paper • 2406.02347 • Published Jun 4, 2024 • 2