InternVL1.0 - a OpenGVLab Collection

OpenGVLab 's Collections

PIIP

VideoChat-Flash

InternVL2.5-MPO

V2PE

InternVL Adaptation

All-Seeing Project

PVT v2

InternVL1.0

updated 22 days ago

Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Paper • 2312.14238 • Published Dec 21, 2023 • 20

Note CVPR 2024, Oral
OpenGVLab/InternViT-6B-224px

Image Feature Extraction • Updated Dec 9, 2024 • 164 • 23
OpenGVLab/InternVL-14B-224px

Image Feature Extraction • Updated Dec 9, 2024 • 436 • 35
OpenGVLab/InternVL-Chat-V1-2-Plus

Image-Text-to-Text • Updated Mar 25 • 327 • 35

Note Relased at 2024.02.21 | 40B parameters | More SFT data and stronger.
OpenGVLab/InternVL-Chat-V1-2

Image-Text-to-Text • Updated Mar 25 • 364 • 17

Note Released at 2024.02.11 | 40B parameters | scaling up LLM to 34B.
OpenGVLab/InternVL-Chat-V1-1

Image-Text-to-Text • Updated Mar 25 • 553 • 13

Note Released at 2024.01.24 | 19B parameters | support Chinese and stronger OCR
OpenGVLab/InternViT-6B-448px-V1-2

Image Feature Extraction • Updated Dec 9, 2024 • 361 • 25

Note Released at 2024.02.11 | Vision Foundation Model | 448 resolution
OpenGVLab/InternViT-6B-448px-V1-0

Image Feature Extraction • Updated Dec 9, 2024 • 21 • 8

Note Released at 2024.01.30 | Vision Foundation Model | 448 resolution
OpenGVLab/InternVL-14B-Flickr30K-FT-364px

Feature Extraction • Updated Aug 24, 2024 • 34 • 7
OpenGVLab/InternVL-14B-FlickrCN-FT-364px

Updated Aug 24, 2024 • 1 • 3
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B

Visual Question Answering • Updated Aug 24, 2024 • 121 • 8
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B

Visual Question Answering • Updated Aug 24, 2024 • 23 • 7
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B-448px

Visual Question Answering • Updated Aug 24, 2024 • 15 • 4
OpenGVLab/InternVL

Updated Dec 25, 2024 • 34