DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published 10 days ago • 19
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts Paper • 2403.08268 • Published Mar 13, 2024 • 15
TIP: Text-Driven Image Processing with Semantic and Restoration Instructions Paper • 2312.11595 • Published Dec 18, 2023 • 5
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators Paper • 2312.03793 • Published Dec 6, 2023 • 17
MagicStick: Controllable Video Editing via Control Handle Transformations Paper • 2312.03047 • Published Dec 5, 2023 • 9
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Paper • 2306.17842 • Published Jun 30, 2023 • 9