tsunghanwu commited on
Commit
1c9158c
1 Parent(s): 40f5bfe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -15
README.md CHANGED
@@ -2,18 +2,12 @@
2
  license: mit
3
  ---
4
 
5
- SESAME Model Card
6
-
7
- Model details
8
- Model type: SESAME is an open-source multimodal model trained by fine-tuning LLaVA on various instruction-based image grounding (segmentation) data. It is an auto-regressive language model plus a segmentation model.
9
-
10
- Paper or resources for more information: https://see-say-segment.github.io/
11
-
12
- Where to send questions or comments about the model: https://github.com/see-say-segment/sesame/issues
13
-
14
- Intended use
15
- Primary intended uses: The primary use of SESAME is research on large multimodal models and chatbots.
16
-
17
- Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
18
-
19
- Training dataset: (FP-/R-)RefCOCO(+/g) + LLaVA 150K VQA data
 
2
  license: mit
3
  ---
4
 
5
+ ## SESAME
6
+
7
+ - Model type: SESAME is an open-source multimodal model trained by fine-tuning LLaVA on various instruction-based image grounding (segmentation) data. It is an auto-regressive language model plus a segmentation model.
8
+ - Paper or resources for more information: https://see-say-segment.github.io/
9
+ - Where to send questions or comments about the model: https://github.com/see-say-segment/sesame/issues
10
+ - Intended use
11
+ - Primary intended uses: The primary use of SESAME is research on large multimodal models and chatbots.
12
+ - Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
13
+ - Training dataset: (FP-/R-)RefCOCO(+/g) + LLaVA 150K VQA data