20 58 26

HAODONG DUAN

KennyUTC

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

authored a paper 3 days ago

MM-IFEngine: Towards Multimodal Instruction Following

upvoted a paper 3 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

upvoted a paper 3 days ago

MM-IFEngine: Towards Multimodal Instruction Following

View all activity

Organizations

Posts 2

Post

1547

OPEN VLM LEADERBOARD JUST RELEASED the FULL EVALUATION RESULTS of GPT-4o

[TL;DR]
GPT-4o shows steady progress compared to GPT-4v (0419), with a 3% improvement on the average score (68.7% -> 72.1%). GPT-4o displays stronger perception and less hallucination.

opencompass/open_vlm_leaderboard

View all Posts

Articles 2

Article

View a static content space

models

None public yet

datasets

None public yet

HAODONG DUAN

AI & ML interests

Recent Activity

Organizations

Posts 2

Articles 2

Claude-3.5 Evaluation Results on Open VLM Leaderboard

Papers 36

spaces 1

BotChat

models

datasets