Multimodal Language Model Benchmarks - a sherzod-hakimov Collection

sherzod-hakimov 's Collections

Multimodal Language Model Benchmarks

Multimodal Language Model Benchmarks

updated Sep 11

Multimodal benchmarks that test various aspects of LLMs, VLMs, LMMs

Running

3

🏆

Multimodal Clembench
Running

81

🏆

SEED-Bench Leaderboard
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

Note Leaderboard: https://mmmu-benchmark.github.io/#leaderboard
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4 • 28
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Paper • 2406.18521 • Published Jun 26 • 28

Note Leaderboard: https://charxiv.github.io/?s=09#leaderboard
FanqingM/MMIU-Benchmark

Viewer • Updated Aug 8 • 11.7k • 133 • 6

Note Leaderboard: https://mmiu-bench.github.io/#leaderboard
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

Paper • 2401.11944 • Published Jan 22 • 27

Note Leaderboard: https://cmmmu-benchmark.github.io/#leaderboard
Running on CPU Upgrade

536

🌎

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Paper • 2408.03361 • Published Aug 6 • 85

Note Dataset: https://huggingface.co/datasets/OpenGVLab/GMAI-MMBench
Running

532

🖼💬

Vision Arena (Testing VLMs side-by-side)
Running

23

🥇

MM-UPD Leaderboard
Running

93

🥇

Vidore Leaderboard
Running

18

🚀

MMBench Leaderboard
topyun/SPARK

Viewer • Updated Aug 23 • 6.25k • 94 • 15