InternVL1.0 - a OpenGVLab Collection

OpenGVLab 's Collections

InternVL2.5-MPO

V2PE

InternVL Adaptation

All-Seeing Project

PVT v2

InternVL1.0

updated 3 days ago

Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Paper • 2312.14238 • Published Dec 21, 2023 • 18

Note CVPR 2024, Oral
OpenGVLab/InternViT-6B-224px

Image Feature Extraction • Updated 15 days ago • 577 • 23
OpenGVLab/InternVL-14B-224px

Image Feature Extraction • Updated 15 days ago • 4.64k • 36
OpenGVLab/InternVL-Chat-V1-2-Plus

Image-Text-to-Text • Updated 6 days ago • 255 • 35

Note Relased at 2024.02.21 | 40B parameters | More SFT data and stronger.
OpenGVLab/InternVL-Chat-V1-2

Image-Text-to-Text • Updated 6 days ago • 235 • 19

Note Released at 2024.02.11 | 40B parameters | scaling up LLM to 34B.
OpenGVLab/InternVL-Chat-V1-1

Image-Text-to-Text • Updated 6 days ago • 233 • 14

Note Released at 2024.01.24 | 19B parameters | support Chinese and stronger OCR
OpenGVLab/InternViT-6B-448px-V1-2

Image Feature Extraction • Updated 15 days ago • 172 • 27

Note Released at 2024.02.11 | Vision Foundation Model | 448 resolution
OpenGVLab/InternViT-6B-448px-V1-0

Image Feature Extraction • Updated 15 days ago • 40 • 10

Note Released at 2024.01.30 | Vision Foundation Model | 448 resolution
OpenGVLab/InternVL-14B-Flickr30K-FT-364px

Feature Extraction • Updated Aug 24 • 15 • 8
OpenGVLab/InternVL-14B-FlickrCN-FT-364px

Updated Aug 24 • 15 • 4
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B

Visual Question Answering • Updated Aug 24 • 31 • 9
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B

Visual Question Answering • Updated Aug 24 • 21 • 8
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B-448px

Visual Question Answering • Updated Aug 24 • 12 • 5
OpenGVLab/InternVL

Updated Oct 24 • 25