28 2

Yue Fan

yfan1997

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

yfan1997/mm-cot-data

published a model 2 days ago

yfan1997/mm-cot-data

published a dataset 6 days ago

yfan1997/gqa

View all activity

Organizations

None yet

yfan1997's activity

updated a dataset 1 day ago

yfan1997/mm-cot-data

Viewer • Updated 1 day ago • 329 • 45

published a model 2 days ago

yfan1997/mm-cot-data

Updated 2 days ago

published a dataset 6 days ago

yfan1997/gqa

Updated 6 days ago • 29

published a dataset 7 days ago

yfan1997/mm-cot-data

Viewer • Updated 1 day ago • 329 • 45

upvoted a paper 18 days ago

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

Paper • 2502.16033 • Published 21 days ago • 16

updated a dataset 5 months ago

yfan1997/AVDN

Viewer • Updated Oct 9, 2024 • 3.06k • 64

updated a dataset 8 months ago

yfan1997/ScreenPR

Viewer • Updated Jul 17, 2024 • 650 • 364 • 6

authored 5 papers 9 months ago

JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents

Paper • 2208.13266 • Published Aug 28, 2022 • 1

Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA

Paper • 2401.15847 • Published Jan 29, 2024 • 2

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12, 2024 • 28

LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models

Paper • 2310.03903 • Published Oct 5, 2023

Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Paper • 2406.19263 • Published Jun 27, 2024 • 10

upvoted a paper 9 months ago

Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Paper • 2406.19263 • Published Jun 27, 2024 • 10

updated a dataset 11 months ago

yfan1997/MultipanelVQA_synthetic

Viewer • Updated Apr 7, 2024 • 2.1k • 134 • 1

New activity in WildVision/vision-arena about 1 year ago

Finding hard tasks for vision models, though easy for humans: MAD magazine 'fold-ins'

#3 opened about 1 year ago by

reddgr

updated 2 datasets about 1 year ago

yfan1997/MultipanelVQA_real-world

Viewer • Updated Jan 31, 2024 • 100 • 324 • 4

yfan1997/test

Viewer • Updated Jan 28, 2024 • 100 • 256

New activity in jinggu/MultipanelVQA about 1 year ago

Upload 100 files

#29 opened about 1 year ago by

yfan1997

Update README.md

#28 opened about 1 year ago by

yfan1997

Update README.md

#27 opened about 1 year ago by

yfan1997