Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model Paper • 2408.00754 • Published Aug 1 • 21
COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? Paper • 2305.03689 • Published May 5, 2023 • 2
COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? Paper • 2305.03689 • Published May 5, 2023 • 2 • 1