A Recipe for Scaling up Text-to-Video Generation with Text-free Videos Paper • 2312.15770 • Published Dec 25, 2023 • 12
HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Paper • 2312.15980 • Published Dec 26, 2023 • 10
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers Paper • 2312.12468 • Published Dec 19, 2023 • 7
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors Paper • 2312.13324 • Published Dec 20, 2023 • 8
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Paper • 2312.13763 • Published Dec 21, 2023 • 9
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis Paper • 2312.13834 • Published Dec 20, 2023 • 25
VideoPoet: A Large Language Model for Zero-Shot Video Generation Paper • 2312.14125 • Published Dec 21, 2023 • 41
VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams Paper • 2312.01407 • Published Dec 3, 2023 • 6
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper • 2311.07069 • Published Nov 13, 2023 • 43
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Paper • 2311.12454 • Published Nov 21, 2023 • 27
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Paper • 2311.10093 • Published Nov 16, 2023 • 54
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Paper • 2311.10709 • Published Nov 17, 2023 • 24
MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer Paper • 2311.12052 • Published Nov 18, 2023 • 28
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models Paper • 2312.00845 • Published Dec 1, 2023 • 36
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation Paper • 2310.10769 • Published Oct 16, 2023 • 8
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models Paper • 2310.01107 • Published Oct 2, 2023 • 4
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning Paper • 2309.15091 • Published Sep 26, 2023 • 31
Grounded Text-to-Image Synthesis with Attention Refocusing Paper • 2306.05427 • Published Jun 8, 2023 • 2
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis Paper • 2310.00426 • Published Sep 30, 2023 • 60
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation Paper • 2309.15818 • Published Sep 27, 2023 • 18