20 59 26

HAODONG DUAN

KennyUTC

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

upvoted a paper 7 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

authored a paper 11 days ago

MM-IFEngine: Towards Multimodal Instruction Following

upvoted a paper 11 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

View all activity

Organizations

KennyUTC's activity

upvoted a paper 7 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 8 days ago • 237

authored a paper 11 days ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published 12 days ago • 34

upvoted 2 papers 11 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published 12 days ago • 45

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published 12 days ago • 34

upvoted a paper 13 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 15 days ago • 168

updated a dataset 14 days ago

VLMEval/OpenVLMRecords

Updated 14 days ago • 510 • 6

authored a paper 18 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 19 days ago • 67

upvoted a paper 19 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 19 days ago • 67

commented a paper 19 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 19 days ago • 67 •

authored a paper 26 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 28 days ago • 34

upvoted a paper 26 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 28 days ago • 34

commented a paper 26 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 28 days ago • 34 •

liked 3 datasets 27 days ago

authored a paper about 1 month ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 47

upvoted 2 papers about 1 month ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 47

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 35

authored a paper about 1 month ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 35

upvoted a paper about 1 month ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 119