Running
3
π
Multimodal benchmarks that test various aspects of LLMs, VLMs, LMMs
Note Leaderboard: https://mmmu-benchmark.github.io/#leaderboard
Note Leaderboard: https://charxiv.github.io/?s=09#leaderboard
Note Leaderboard: https://mmiu-bench.github.io/#leaderboard
Note Leaderboard: https://cmmmu-benchmark.github.io/#leaderboard
VLMEvalKit Evaluation Results Collection
Note Dataset: https://huggingface.co/datasets/OpenGVLab/GMAI-MMBench