# 🔥🏅️GenCeption Leaderboard 🏅️🔥 Evaluated MLLMs: [ChatGPT-4V](https://cdn.openai.com/papers/GPTV_System_Card.pdf), [mPLUG-Owl2](https://arxiv.org/pdf/2311.04257.pdf), [LLaVA-13B](https://arxiv.org/pdf/2304.08485.pdf), [LLaVA-7B](https://arxiv.org/pdf/2304.08485.pdf)
Existence | Count |
---|---|
| Model | GC@3| |--|--| | ChatGPT-4V|0.422 | | mPLUG-Owl2|0.323 | | LLaVA-7B|0.308 | | LLaVA-13B|0.305 | | | Model | GC@3| |--|--| | ChatGPT-4V|0.404 | | mPLUG-Owl2|0.299 | | LLaVA-13B|0.294 | | LLaVA-7B|0.353 | |
Position | Color |
---|---|
| Model | GC@3| |--|--| | ChatGPT-4V|0.408| | mPLUG-Owl2|0.306 | | LLaVA-7B|0.285 | | LLaVA-13B|0.255 | | | Model | GC@3| |--|--| | ChatGPT-4V|0.403 | | LLaVA-13B|0.300 | | mPLUG-Owl2|0.290 | | LLaVA-7B|0.284 | |
Poster | Celebrity |
---|---|
| Model | GC@3| |--|--| | ChatGPT-4V|0.324| | mPLUG-Owl2|0.243 | | LLaVA-13B|0.215 | | LLaVA-7B|0.214 | | | Model | GC@3| |--|--| | ChatGPT-4V|0.332 | | mPLUG-Owl2|0.232 | | LLaVA-13B|0.206 | | LLaVA-7B|0.188 | |
Scene | Landmark |
---|---|
| Model | GC@3| |--|--| | ChatGPT-4V|0.393| | mPLUG-Owl2|0.299 | | LLaVA-13B|0.277 | | LLaVA-7B|0.266 | | | Model | GC@3| |--|--| | ChatGPT-4V|0.353 | | mPLUG-Owl2|0.275 | | LLaVA-7B|0.252 | | LLaVA-13B|0.242 | |
Artwork | Commonsense Reasoning |
---|---|
| Model | GC@3| |--|--| | ChatGPT-4V|0.421| | mPLUG-Owl2|0.252 | | LLaVA-13B|0.212 | | LLaVA-7B|0.210 | | | Model | GC@3| |--|--| | ChatGPT-4V|0.471 | | mPLUG-Owl2|0.353 | | LLaVA-13B|0.334 | | LLaVA-7B|0.294 | |
Code Reasoning | Numerical Calculation |
---|---|
| Model | GC@3| |--|--| | ChatGPT-4V|0.193| | mPLUG-Owl2|0.176 | | LLaVA-13B|0.144 | | LLaVA-7B|0.107 | | | Model | GC@3| |--|--| | ChatGPT-4V|0.240 | | LLaVA-13B|0.195 | | mPLUG-Owl2|0.192 | | LLaVA-7B|0.155 | |
Text Translation | OCR |
---|---|
| Model | GC@3| |--|--| | ChatGPT-4V|0.157| | LLaVA-13B|0.116 | | LLaVA-7B|0.111 | | mPLUG-Owl2|0.081 | | | Model | GC@3| |--|--| | ChatGPT-4V|0.393 | | mPLUG-Owl2|0.276 | | LLaVA-13B|0.239 | | LLaVA-7B|0.222 | |