16 How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites · 27 authors 1
4 Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings · 11 authors
4 SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension · 6 authors