ShineChen1024
's Collections
diffusion
updated
Style Aligned Image Generation via Shared Attention
Paper
•
2312.02133
•
Published
•
8
FaceStudio: Put Your Face Everywhere in Seconds
Paper
•
2312.02663
•
Published
•
27
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded
Diffusion Model
Paper
•
2312.02238
•
Published
•
24
Orthogonal Adaptation for Modular Customization of Diffusion Models
Paper
•
2312.02432
•
Published
•
12
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Paper
•
2312.04461
•
Published
•
48
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Paper
•
2312.04410
•
Published
•
14
CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Paper
•
2312.06971
•
Published
•
10
FreeControl: Training-Free Spatial Control of Any Text-to-Image
Diffusion Model with Any Condition
Paper
•
2312.07536
•
Published
•
15
DiffMorpher: Unleashing the Capability of Diffusion Models for Image
Morphing
Paper
•
2312.07409
•
Published
•
20
Your Student is Better Than Expected: Adaptive Teacher-Student
Collaboration for Text-Conditional Diffusion Models
Paper
•
2312.10835
•
Published
•
5
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image
Inpainting with Diffusion Models
Paper
•
2312.14091
•
Published
•
13
DreamDistribution: Prompt Distribution Learning for Text-to-Image
Diffusion Models
Paper
•
2312.14216
•
Published
•
10
One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and
Erasing Applications
Paper
•
2312.16145
•
Published
•
8
Prompt Expansion for Adaptive Text-to-Image Generation
Paper
•
2312.16720
•
Published
•
4
SSR-Encoder: Encoding Selective Subject Representation for
Subject-Driven Generation
Paper
•
2312.16272
•
Published
•
5
Instruct-Imagen: Image Generation with Multi-modal Instruction
Paper
•
2401.01952
•
Published
•
29
Improving Diffusion-Based Image Synthesis with Context Prediction
Paper
•
2401.02015
•
Published
•
6
Score Distillation Sampling with Learned Manifold Corrective
Paper
•
2401.05293
•
Published
•
6
PIXART-δ: Fast and Controllable Image Generation with Latent
Consistency Models
Paper
•
2401.05252
•
Published
•
43
PALP: Prompt Aligned Personalization of Text-to-Image Models
Paper
•
2401.06105
•
Published
•
46
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable
Interpolant Transformers
Paper
•
2401.08740
•
Published
•
10
Synthesizing Moving People with 3D Control
Paper
•
2401.10889
•
Published
•
11
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass
Diffusion Transformers
Paper
•
2401.11605
•
Published
•
19
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
•
2401.11708
•
Published
•
27
BootPIG: Bootstrapping Zero-shot Personalized Image Generation
Capabilities in Pretrained Diffusion Models
Paper
•
2401.13974
•
Published
•
11
CreativeSynth: Creative Blending and Synthesis of Visual Arts based on
Multimodal Diffusion
Paper
•
2401.14066
•
Published
•
7
UNIMO-G: Unified Image Generation through Multimodal Conditional
Diffusion
Paper
•
2401.13388
•
Published
•
9
Divide and Conquer: Language Models can Plan and Self-Correct for
Compositional Text-to-Image Generation
Paper
•
2401.15688
•
Published
•
10
Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with
Prototypical Embedding
Paper
•
2401.15708
•
Published
•
9
StableIdentity: Inserting Anybody into Anywhere at First Sight
Paper
•
2401.15975
•
Published
•
16
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion
Models by Leveraging CLIP Latent Space
Paper
•
2402.05195
•
Published
•
16
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Paper
•
2402.06088
•
Published
•
9
DreamMatcher: Appearance Matching Self-Attention for
Semantically-Consistent Text-to-Image Personalization
Paper
•
2402.09812
•
Published
•
11
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Paper
•
2402.10210
•
Published
•
28
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward
Finetuning of Diffusion Models
Paper
•
2402.08714
•
Published
•
10
Make a Cheap Scaling: A Self-Cascade Diffusion Model for
Higher-Resolution Adaptation
Paper
•
2402.10491
•
Published
•
15
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image
Generation
Paper
•
2402.11929
•
Published
•
9
FiT: Flexible Vision Transformer for Diffusion Model
Paper
•
2402.12376
•
Published
•
46
RealCompo: Dynamic Equilibrium between Realism and Compositionality
Improves Text-to-Image Diffusion Models
Paper
•
2402.12908
•
Published
•
5
D-Flow: Differentiating through Flows for Controlled Generation
Paper
•
2402.14017
•
Published
•
5
Multi-LoRA Composition for Image Generation
Paper
•
2402.16843
•
Published
•
28
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized
Diffusion Model
Paper
•
2402.17412
•
Published
•
21
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain
Text-to-Image Customization
Paper
•
2403.00483
•
Published
•
8
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
Paper
•
2403.02084
•
Published
•
11
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K
Text-to-Image Generation
Paper
•
2403.04692
•
Published
•
35
StableDrag: Stable Dragging for Point-based Image Editing
Paper
•
2403.04437
•
Published
•
23
DragAnything: Motion Control for Anything using Entity Representation
Paper
•
2403.07420
•
Published
•
11
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Paper
•
2403.09622
•
Published
•
10
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based
Semantic Control
Paper
•
2403.09055
•
Published
•
23
Infinite-ID: Identity-preserved Personalization via ID-semantics
Decoupling Paradigm
Paper
•
2403.11781
•
Published
•
17
FouriScale: A Frequency Perspective on Training-Free High-Resolution
Image Synthesis
Paper
•
2403.12963
•
Published
•
6
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of
Text-to-Image Models
Paper
•
2403.13535
•
Published
•
20
ReNoise: Real Image Inversion Through Iterative Noising
Paper
•
2403.14602
•
Published
•
19
TextCraftor: Your Text Encoder Can be Image Quality Controller
Paper
•
2403.18978
•
Published
•
12
CosmicMan: A Text-to-Image Foundation Model for Humans
Paper
•
2404.01294
•
Published
•
15
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Paper
•
2404.01197
•
Published
•
29
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
•
2404.03653
•
Published
•
28
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion
Models
Paper
•
2404.02747
•
Published
•
11
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image
Generation
Paper
•
2404.02733
•
Published
•
19
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Paper
•
2404.05595
•
Published
•
20
ByteEdit: Boost, Comply and Accelerate Generative Image Editing
Paper
•
2404.04860
•
Published
•
24
SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual
Editing
Paper
•
2404.05717
•
Published
•
23
BeyondScene: Higher-Resolution Human-Centric Scene Generation With
Pretrained Diffusion
Paper
•
2404.04544
•
Published
•
20
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Paper
•
2404.05674
•
Published
•
9
Aligning Diffusion Models by Optimizing Human Utility
Paper
•
2404.04465
•
Published
•
12
Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models
Paper
•
2404.04478
•
Published
•
11
ControlNet++: Improving Conditional Controls with Efficient Consistency
Feedback
Paper
•
2404.07987
•
Published
•
45
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
Controls to Any Diffusion Model
Paper
•
2404.09967
•
Published
•
20
MoA: Mixture-of-Attention for Subject-Context Disentanglement in
Personalized Image Generation
Paper
•
2404.11565
•
Published
•
12
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Paper
•
2404.14507
•
Published
•
21
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with
Reward Feedback Learning
Paper
•
2404.15449
•
Published
•
11
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Paper
•
2404.16022
•
Published
•
16
Editable Image Elements for Controllable Synthesis
Paper
•
2404.16029
•
Published
•
9
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity
Preserving
Paper
•
2404.16771
•
Published
•
16
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
•
2404.19427
•
Published
•
60
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
Generation
Paper
•
2405.01434
•
Published
•
34