view article Article MotionLCM-V2: Improved Compression Rate for Multi-Latent-Token Diffusion By wxDai β’ 22 days ago β’ 12
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video Paper β’ 2411.18671 β’ Published Nov 27, 2024 β’ 20
DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Paper β’ 2411.14347 β’ Published Nov 21, 2024 β’ 13 β’ 2
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Paper β’ 2410.18977 β’ Published Oct 24, 2024 β’ 14
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Paper β’ 2410.18977 β’ Published Oct 24, 2024 β’ 14
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Paper β’ 2410.18977 β’ Published Oct 24, 2024 β’ 14 β’ 2
view article Article Introducing MotionCLR: Interactive Motion Editing By EvanTHU β’ Oct 24, 2024 β’ 1