VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Abstract
Text-to-video diffusion models have advanced video generation significantly. However, customizing these models to generate videos with tailored motions presents a substantial challenge. In specific, they encounter hurdles in (a) accurately reproducing motion from a target video, and (b) creating diverse visual variations. For example, straightforward extensions of static image customization methods to video often lead to intricate entanglements of appearance and motion data. To tackle this, here we present the Video Motion Customization (VMC) framework, a novel one-shot tuning approach crafted to adapt temporal attention layers within video diffusion models. Our approach introduces a novel motion distillation objective using residual vectors between consecutive frames as a motion reference. The diffusion process then preserves low-frequency motion trajectories while mitigating high-frequency motion-unrelated noise in image space. We validate our method against state-of-the-art video generative models across diverse real-world motions and contexts. Our codes, data and the project demo can be found at https://video-motion-customization.github.io
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- MotionDirector: Motion Customization of Text-to-Video Diffusion Models (2023)
- LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation (2023)
- Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning (2023)
- Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer (2023)
- DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors (2023)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
Mastering Video Motions: Deep Dive into VMC with Temporal Attention Adaptation!
Links ๐:
๐ Subscribe: https://www.youtube.com/@Arxflix
๐ Twitter: https://x.com/arxflix
๐ LMNT (Partner): https://lmnt.com/
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper