-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper โข 2402.17485 โข Published โข 184 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper โข 2312.01841 โข Published โข 1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper โข 2311.16498 โข Published โข 1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper โข 2312.02134 โข Published โข 2
Collections
Discover the best community collections!
Collections including paper arxiv:2404.16771
-
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Paper โข 2404.03620 โข Published -
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Paper โข 2404.16022 โข Published โข 16 -
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Paper โข 2404.15449 โข Published โข 11 -
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Paper โข 2404.16771 โข Published โข 16
-
EdgeFusion: On-Device Text-to-Image Generation
Paper โข 2404.11925 โข Published โข 20 -
Dynamic Typography: Bringing Words to Life
Paper โข 2404.11614 โข Published โข 40 -
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Paper โข 2404.07987 โข Published โข 46 -
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
Paper โข 2404.07724 โข Published โข 10
-
ฮป-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Paper โข 2402.05195 โข Published โข 16 -
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Paper โข 2402.09812 โข Published โข 11 -
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper โข 2311.10093 โข Published โข 55 -
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models
Paper โข 2403.13535 โข Published โข 20
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper โข 2401.09048 โข Published โข 7 -
Improving fine-grained understanding in image-text pre-training
Paper โข 2401.09865 โข Published โข 14 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper โข 2401.10891 โข Published โข 54 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper โข 2401.13627 โข Published โข 70
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper โข 2306.07967 โข Published โข 24 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper โข 2306.07954 โข Published โข 111 -
TryOnDiffusion: A Tale of Two UNets
Paper โข 2306.08276 โข Published โข 71 -
Seeing the World through Your Eyes
Paper โข 2306.09348 โข Published โข 31
-
VideoBooth: Diffusion-based Video Generation with Image Prompts
Paper โข 2312.00777 โข Published โข 19 -
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Paper โข 2312.03641 โข Published โข 19 -
GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Paper โข 2312.04557 โข Published โข 12 -
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Paper โข 2312.04433 โข Published โข 9