JeasLee commited on
Commit
7296bce
·
verified ·
1 Parent(s): cc5adc9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -1,3 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # RoboInterVLM: Vision-Language Model Checkpoints for RoboInter Manipulation Suite
2
 
3
  Model checkpoints of **RoboInterVLM**, developed as part of the [RoboInter](https://github.com/InternRobotics/RoboInter) project. These models are fine-tuned on the [RoboInter-VQA](https://huggingface.co/datasets/InternRobotics/RoboInter-VQA) dataset for intermediate representation understanding and generation in robotic manipulation.
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - Qwen/Qwen2.5-VL-3B-Instruct
5
+ - Qwen/Qwen2.5-VL-7B-Instruct
6
+ - lmms-lab/llava-onevision-qwen2-7b-ov
7
+ tags:
8
+ - robotics
9
+ - vision-language-action-model
10
+ - vision-language-model
11
+ library_name: transformers
12
+
13
+ # Collection Metadata (Referencing InternRobotics/VLN-PE style)
14
+ repo: InternRobotics/RoboInter-VLM
15
+ type: "checkpoint-collection"
16
+ description: "Collection of RoboInterVLM checkpoints and configs fine-tuned on RoboInter-VQA."
17
+ checkpoints:
18
+ - name: RoboInterVLM_qwenvl25_3b
19
+ path: RoboInterVLM_qwenvl25_3b/
20
+ notes: "Lightweight Qwen2.5-VL model"
21
+ - name: RoboInterVLM_qwenvl25_7b
22
+ path: RoboInterVLM_qwenvl25_7b/
23
+ notes: "Stronger performance Qwen2.5-VL backbone"
24
+ - name: RoboInterVLM_llava_one_vision_7B
25
+ path: RoboInterVLM_llava_one_vision_7B/
26
+ notes: "LLaVA-OneVision (SigLIP + Qwen2) backbone"
27
+ ---
28
+
29
  # RoboInterVLM: Vision-Language Model Checkpoints for RoboInter Manipulation Suite
30
 
31
  Model checkpoints of **RoboInterVLM**, developed as part of the [RoboInter](https://github.com/InternRobotics/RoboInter) project. These models are fine-tuned on the [RoboInter-VQA](https://huggingface.co/datasets/InternRobotics/RoboInter-VQA) dataset for intermediate representation understanding and generation in robotic manipulation.