Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shijia Yang's picture
1 5

Shijia Yang

shijiay
Youhatang's profile picture 21world's profile picture dark-pen's profile picture
·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 7 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24
upvoted an article 10 months ago
view article
Article

Key Insights into the Law of Vision Representations in MLLMs

By Borise •
Sep 2, 2024
• 18
upvoted 3 papers 10 months ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 96

Multitask Vision-Language Prompt Tuning

Paper • 2211.11720 • Published Nov 21, 2022 • 2

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

Paper • 2310.01779 • Published Oct 3, 2023 • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs