Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
EgoPet: Egomotion and Interaction Data from an Animal's Perspective Paper • 2404.09991 • Published Apr 15, 2024
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published Feb 21 • 2
Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Paper • 2305.14334 • Published May 23, 2023 • 1
Readout Guidance: Learning Control from Diffusion Features Paper • 2312.02150 • Published Dec 4, 2023 • 3