Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhaoyuzhong's picture
3 3 1

zhaoyuzhong

callsys
harvardcly's profile picture
·

AI & ML interests

computer vision

Organizations

None yet

callsys's activity

upvoted 2 papers 7 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Paper • 2411.19108 • Published Nov 28, 2024 • 19
upvoted a paper about 1 year ago

DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution

Paper • 2405.16071 • Published May 25, 2024 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs