HAODONG DUAN

KennyUTC

AI & ML interests

Video Understanding; Multi-Modal Learning

Articles

Organizations

Posts 1

view post
Post
2154
Open VLM Leaderboard just updated the performance of GPT-4v (20240409), the new proprietary model ranked 1st across 50+ VLMs. Compared to the pervious version (20231106), the improvements on multimodal perception and reasoning are both huge.

Check the results:
opencompass/open_vlm_leaderboard

models

None public yet

datasets

None public yet