Perception Tokens Enhance Visual Reasoning in Multimodal Language Models Paper • 2412.03548 • Published 17 days ago • 16