Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models Paper โข 2406.14035 โข Published Jun 20 โข 12
Qwen2-VL Collection Vision-language model series based on Qwen2 โข 15 items โข Updated Sep 18 โข 151
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. โข 7 items โข Updated Aug 24 โข 10