Are Vision-Language Models Truly Understanding Multi-vision Sensor? Paper • 2412.20750 • Published 6 days ago • 15 • 2
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models Paper • 2408.12114 • Published Aug 22, 2024 • 12 • 3