Solaren
's Collections
FreeU: Free Lunch in Diffusion U-Net
Paper
•
2309.11497
•
Published
•
64
Imagic: Text-Based Real Image Editing with Diffusion Models
Paper
•
2210.09276
•
Published
On Architectural Compression of Text-to-Image Diffusion Models
Paper
•
2305.15798
•
Published
•
4
Wuerstchen: Efficient Pretraining of Text-to-Image Models
Paper
•
2306.00637
•
Published
•
12
CLIP-KD: An Empirical Study of Distilling CLIP Models
Paper
•
2307.12732
•
Published
Online Clustered Codebook
Paper
•
2307.15139
•
Published
•
1
Residual Denoising Diffusion Models
Paper
•
2308.13712
•
Published
•
2
InstaFlow: One Step is Enough for High-Quality Diffusion-Based
Text-to-Image Generation
Paper
•
2309.06380
•
Published
•
32
Restart Sampling for Improving Generative Processes
Paper
•
2306.14878
•
Published
•
5
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Paper
•
2306.07280
•
Published
•
20
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
Haystack
Paper
•
2309.15807
•
Published
•
32
Finite Scalar Quantization: VQ-VAE Made Simple
Paper
•
2309.15505
•
Published
•
21
Muse: Text-To-Image Generation via Masked Generative Transformers
Paper
•
2301.00704
•
Published
PixArt-α: Fast Training of Diffusion Transformer for
Photorealistic Text-to-Image Synthesis
Paper
•
2310.00426
•
Published
•
61
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model
Statistics
Paper
•
2310.13268
•
Published
•
17
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons
Images
Paper
•
2310.16825
•
Published
•
32
Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of
Experts And Frequency-augmented Decoder Approach
Paper
•
2310.12004
•
Published
•
2
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial
Understanding
Paper
•
2310.15308
•
Published
•
22
Matryoshka Diffusion Models
Paper
•
2310.15111
•
Published
•
41
Beyond U: Making Diffusion Models Faster & Lighter
Paper
•
2310.20092
•
Published
•
11
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
Models
Paper
•
2311.04145
•
Published
•
32
Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion
Models
Paper
•
2309.14068
•
Published
•
1
Denoising Diffusion Step-aware Models
Paper
•
2310.03337
•
Published
•
1
DiffNAS: Bootstrapping Diffusion Models by Prompting for Better
Architectures
Paper
•
2310.04750
•
Published
•
1
PIXART-δ: Fast and Controllable Image Generation with Latent
Consistency Models
Paper
•
2401.05252
•
Published
•
47
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
•
2401.11708
•
Published
•
30
UNIMO-G: Unified Image Generation through Multimodal Conditional
Diffusion
Paper
•
2401.13388
•
Published
•
11
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Paper
•
2401.14404
•
Published
•
17
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
•
2404.03653
•
Published
•
33