Scenes as Objects, Not Primitives: Instance-Structured 3D Tokenization from Unposed Views Paper • 2606.29513 • Published 5 days ago • 42
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs Paper • 2507.07990 • Published Jul 10, 2025 • 45