gary109
's Collections
Diffusion Model
updated
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Paper
•
2309.03895
•
Published
•
11
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
Planning
Paper
•
2309.16650
•
Published
•
7
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Paper
•
2309.16496
•
Published
•
7
FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling
Paper
•
2310.15169
•
Published
•
8
Wonder3D: Single Image to 3D using Cross-Domain Diffusion
Paper
•
2310.15008
•
Published
•
19
Matryoshka Diffusion Models
Paper
•
2310.15111
•
Published
•
39
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion
Models
Paper
•
2310.13772
•
Published
•
5
HyperFields: Towards Zero-Shot Generation of NeRFs from Text
Paper
•
2310.17075
•
Published
•
13
SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D
Object Pose Estimation
Paper
•
2310.17359
•
Published
•
1
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper
•
2310.17680
•
Published
•
68
CustomNet: Zero-shot Object Customization with Variable-Viewpoints in
Text-to-Image Diffusion Models
Paper
•
2310.19784
•
Published
•
9
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and
Prediction
Paper
•
2310.20700
•
Published
•
8
Beyond U: Making Diffusion Models Faster & Lighter
Paper
•
2310.20092
•
Published
•
11
Controllable Music Production with Diffusion Models and Guidance
Gradients
Paper
•
2311.00613
•
Published
•
23
De-Diffusion Makes Text a Strong Cross-Modal Interface
Paper
•
2311.00618
•
Published
•
21
E3 TTS: Easy End-to-End Diffusion-based Text to Speech
Paper
•
2311.00945
•
Published
•
11
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper
•
2311.10093
•
Published
•
54
UFOGen: You Forward Once Large Scale Text-to-Image Generation via
Diffusion GANs
Paper
•
2311.09257
•
Published
•
43
MagicDance: Realistic Human Dance Video Generation with Motions & Facial
Expressions Transfer
Paper
•
2311.12052
•
Published
•
28
Diffusion Model Alignment Using Direct Preference Optimization
Paper
•
2311.12908
•
Published
•
46
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Paper
•
2312.03793
•
Published
•
17
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
Paper
•
2312.03491
•
Published
•
34
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive
Generation
Paper
•
2312.12491
•
Published
•
66
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
Paper
•
2312.13252
•
Published
•
25
InstructVideo: Instructing Video Diffusion Models with Human Feedback
Paper
•
2312.12490
•
Published
•
14
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for
Single Image Talking Face Generation
Paper
•
2312.13578
•
Published
•
23
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models
and Adapters with Decoupled Consistency Learning
Paper
•
2402.00769
•
Published
•
17
Magic-Me: Identity-Specific Video Customized Diffusion
Paper
•
2402.09368
•
Published
•
24
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
Paper
•
2402.06178
•
Published
•
12
FiT: Flexible Vision Transformer for Diffusion Model
Paper
•
2402.12376
•
Published
•
46
Music Style Transfer with Time-Varying Inversion of Diffusion Models
Paper
•
2402.13763
•
Published
•
9
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
•
2402.17485
•
Published
•
182
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion
Latent Aligners
Paper
•
2402.17723
•
Published
•
15
Scalable Diffusion Models with Transformers
Paper
•
2212.09748
•
Published
•
8
Paper
•
2403.03954
•
Published
•
10
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper
•
2403.05135
•
Published
•
39
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based
Semantic Control
Paper
•
2403.09055
•
Published
•
23