tsunghanwu commited on
Commit
40f5bfe
1 Parent(s): a2765dc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -3
README.md CHANGED
@@ -1,3 +1,19 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ SESAME Model Card
6
+
7
+ Model details
8
+ Model type: SESAME is an open-source multimodal model trained by fine-tuning LLaVA on various instruction-based image grounding (segmentation) data. It is an auto-regressive language model plus a segmentation model.
9
+
10
+ Paper or resources for more information: https://see-say-segment.github.io/
11
+
12
+ Where to send questions or comments about the model: https://github.com/see-say-segment/sesame/issues
13
+
14
+ Intended use
15
+ Primary intended uses: The primary use of SESAME is research on large multimodal models and chatbots.
16
+
17
+ Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
18
+
19
+ Training dataset: (FP-/R-)RefCOCO(+/g) + LLaVA 150K VQA data