VideoBooth: Diffusion-based Video Generation with Image Prompts Paper • 2312.00777 • Published 4 days ago • 12
FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Paper • 2310.15169 • Published Oct 23 • 7
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Paper • 2310.08579 • Published Oct 12 • 13
Octopus: Embodied Vision-Language Programmer from Environmental Feedback Paper • 2310.08588 • Published Oct 12 • 31
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation Paper • 2309.16653 • Published Sep 28 • 35
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Paper • 2309.15103 • Published Sep 26 • 40
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation Paper • 2309.13042 • Published Sep 22 • 8
CityDreamer: Compositional Generative Model of Unbounded 3D Cities Paper • 2309.00610 • Published Sep 1 • 11
DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering Paper • 2307.10173 • Published Jul 19 • 4
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation Paper • 2307.06942 • Published Jul 13 • 18
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Paper • 2306.07954 • Published Jun 13 • 107
Otter: A Multi-Modal Model with In-Context Instruction Tuning Paper • 2305.03726 • Published May 5 • 3