Shijia Yang's picture

1 5

Shijia Yang

shijiay

·

AI & ML interests

None yet

Organizations

None yet

shijiay's activity

upvoted a paper 4 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

upvoted 3 papers 7 months ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 95

Multitask Vision-Language Prompt Tuning

Paper • 2211.11720 • Published Nov 21, 2022 • 2

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

Paper • 2310.01779 • Published Oct 3, 2023 • 4