-
EdgeFusion: On-Device Text-to-Image Generation
Paper • 2404.11925 • Published • 19 -
Dynamic Typography: Bringing Words to Life
Paper • 2404.11614 • Published • 40 -
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Paper • 2404.07987 • Published • 46 -
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
Paper • 2404.07724 • Published • 10
Collections
Discover the best community collections!
Collections including paper arxiv:2404.11614
-
AniClipart: Clipart Animation with Text-to-Video Priors
Paper • 2404.12347 • Published • 11 -
MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation
Paper • 2404.11565 • Published • 12 -
Dynamic Typography: Bringing Words to Life
Paper • 2404.11614 • Published • 40
-
Dynamic Typography: Bringing Words to Life
Paper • 2404.11614 • Published • 40 -
Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer
Paper • 2404.14351 • Published • 5 -
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
Paper • 2404.17672 • Published • 18 -
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Paper • 2406.06525 • Published • 60
-
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Paper • 2403.16990 • Published • 24 -
ViTAR: Vision Transformer with Any Resolution
Paper • 2403.18361 • Published • 48 -
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Paper • 2404.01197 • Published • 29 -
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
Paper • 2404.01367 • Published • 19