Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
srisree 's Collections
Multimodal

Multimodal

updated Mar 12
Upvote
-

  • GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

    Paper • 2311.07562 • Published Nov 13, 2023 • 15

  • VACE: All-in-One Video Creation and Editing

    Paper • 2503.07598 • Published Mar 10 • 49
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs