HAODONG DUAN's picture

HAODONG DUAN

KennyUTC

·

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

authored a paper 3 days ago

MM-IFEngine: Towards Multimodal Instruction Following

upvoted a paper 3 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

upvoted a paper 3 days ago

MM-IFEngine: Towards Multimodal Instruction Following

View all activity

Organizations

KennyUTC's activity

commented a paper 10 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 10 days ago • 67 •

commented a paper 18 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 19 days ago • 33 •

commented a paper 3 months ago

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20 • 29 •

New activity in opencompass/open_vlm_leaderboard 6 months ago

Discrepancy between listed and own accuracy of LLaVA-Onevision-7b-ov on BLINK benchmark

#11 opened 6 months ago by

commented a paper 8 months ago

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Paper • 2408.03361 • Published Aug 6, 2024 • 87 •

New activity in opencompass/MMBench 8 months ago

The leaderboard is not working...

#1 opened 10 months ago by

New activity in opencompass/open_vlm_leaderboard 10 months ago

Add paper link

#10 opened 10 months ago by

commented 2 papers 10 months ago

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Paper • 2406.14544 • Published Jun 20, 2024 • 36 •

MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding

Paper • 2406.14515 • Published Jun 20, 2024 • 34 •

New activity in opencompass/open_vlm_leaderboard 11 months ago

Can we add https://github.com/NVlabs/RADIO

#9 opened 12 months ago by

New activity in opencompass/open_vlm_leaderboard 12 months ago

About the OCRBench

#8 opened 12 months ago by

New activity in opencompass/open_vlm_leaderboard about 1 year ago

This leaderboard is broken...

#7 opened about 1 year ago by

`ScienceQA_IMG` results are not available

#3 opened about 1 year ago by

`COCO Caption` results are not available

#4 opened about 1 year ago by

The provenance linkage is not available after clicking on the model name...

#6 opened about 1 year ago by

Great work!

#2 opened about 1 year ago by

Difference between this and Multi-modal Modal Leaderboard?

#1 opened about 1 year ago by

New activity in 01-ai/Yi-VL-34B about 1 year ago

[Demo] VLMEvalKit now supported demo and evaluation for Yi-VL

#10 opened about 1 year ago by

New activity in 01-ai/Yi-VL-6B about 1 year ago

[Demo] VLMEvalKit now supported demo and evaluation for Yi-VL

#7 opened about 1 year ago by