Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models Paper • 2606.03988 • Published 8 days ago • 114
RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video Paper • 2605.31535 • Published 13 days ago • 7
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 15 days ago • 423