Mind with Eyes: from Language Reasoning to Multimodal Reasoning Paper • 2503.18071 • Published 16 days ago • 3
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Paper • 2503.12797 • Published 22 days ago • 29