DELTA: Dense Efficient Long-range 3D Tracking for any video Paper • 2410.24211 • Published 26 days ago • 8
Learning Video Representations without Natural Videos Paper • 2410.24213 • Published 26 days ago • 14
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents Paper • 2410.22476 • Published 28 days ago • 24
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper • 2410.22366 • Published 29 days ago • 74
Toxicity of the Commons: Curating Open-Source Pre-Training Data Paper • 2410.22587 • Published 27 days ago • 8
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation Paper • 2410.23277 • Published 27 days ago • 7
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference Paper • 2410.21465 • Published 29 days ago • 10
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior Paper • 2410.21264 • Published 29 days ago • 8