Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MBZUAI 's Collections
GeoPixel
BiMediX2
ArTST - Arabic Text Speech Transformer
VideoGPT+
GLaMM
Video-ChatGPT
LLaVA++ (LLaMA-3 and Phi-3-Mini)
PALO
MobiLlama
GeoChat
Satmae++

GLaMM

updated Jun 11, 2024

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated.

Upvote
4

  • MBZUAI/GLaMM-FullScope

    Text Generation • Updated Apr 27, 2024 • 556 • 6

  • MBZUAI/GranD

    Updated Apr 17, 2024 • 213 • 13

  • MBZUAI/GranD-f

    Preview • Updated Mar 21, 2024 • 38 • 11

  • MBZUAI/GLaMM-GranD-Pretrained

    Text Generation • Updated Dec 26, 2023 • 14.9k • 3

  • MBZUAI/GLaMM-FullScope_v0

    Text Generation • Updated Mar 31, 2024 • 88

  • MBZUAI/GLaMM-GCG

    Text Generation • Updated Dec 27, 2023 • 12 • 1

  • MBZUAI/GLaMM-RefSeg

    Text Generation • Updated Dec 26, 2023 • 28 • 1

  • MBZUAI/GLaMM-RegCap-RefCOCOg

    Text Generation • Updated Dec 26, 2023 • 20 • 1

  • MBZUAI/GLaMM-RegCap-VG

    Text Generation • Updated Dec 26, 2023 • 3
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs