File size: 456 Bytes

ed933c3

---
license: other
license_name: tongyi-qwen
license_link: >-
  https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT
pipeline_tag: image-text-to-text
---

This repository contains the model described in [Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning](https://huggingface.co/papers/2412.03565).

Project page: https://inst-it.github.io/

Code: https://github.com/inst-it/inst-it