File size: 456 Bytes
ed933c3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 |
---
license: other
license_name: tongyi-qwen
license_link: >-
https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT
pipeline_tag: image-text-to-text
---
This repository contains the model described in [Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning](https://huggingface.co/papers/2412.03565).
Project page: https://inst-it.github.io/
Code: https://github.com/inst-it/inst-it |