39 LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing · 5 authors 10
7 Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans? · 5 authors 1