Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
adhisetiawan 's Collections
Papers
Multimodal Models
SLMs
LLMs
Audio
Multimodal Papers

Multimodal Models

updated May 27, 2024
Upvote
-

  • microsoft/kosmos-2-patch14-224

    Image-to-Text • Updated Nov 28, 2023 • 179k • 162

  • Tyrannosaurus/TinyGPT-V

    Updated Jan 19, 2024 • 50

  • naver-clova-ix/donut-base

    Image-to-Text • Updated Aug 13, 2022 • 46k • 209

  • llava-hf/llava-v1.6-34b-hf

    Image-Text-to-Text • Updated Jan 27 • 3.08k • 82

  • deepseek-ai/deepseek-vl-7b-base

    Updated Mar 15, 2024 • 1.06k • 59

  • deepseek-ai/deepseek-vl-7b-chat

    Image-Text-to-Text • Updated Mar 15, 2024 • 7.3k • 256

  • vikhyatk/moondream2

    Image-Text-to-Text • Updated Apr 14 • 370k • 1.13k

  • THUDM/cogvlm-chat-hf

    Text Generation • Updated Dec 19, 2023 • 3.42k • 194

  • Qwen/Qwen-VL-Chat

    Text Generation • Updated Jan 25, 2024 • 42.7k • 365

  • Qwen/Qwen-VL

    Text Generation • Updated Jan 25, 2024 • 20.3k • 245

  • microsoft/git-base

    Image-to-Text • Updated Apr 24, 2023 • 289k • 93
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs