tsunghanwu
/

SESAME_minus

Text Generation

Inference Endpoints

Model card Files Files and versions Community

SESAME_minus / README.md

tsunghanwu's picture

Update README.md

5588b4a verified 5 months ago

|

history blame contribute delete

804 Bytes

	---
	license: mit
	---

	## SESAME_minus

	- Model type: SESAME_minus is an open-source multimodal model trained by fine-tuning LLaVA on various instruction-based image grounding (segmentation) data. It is an instruction-baed segmentation model basically, serving as a baseline.
	- Paper or resources for more information: https://see-say-segment.github.io/
	- Where to send questions or comments about the model: https://github.com/see-say-segment/sesame/issues
	- Intended use
	- Primary intended uses: The primary use of SESAME is research on large multimodal models and chatbots.
	- Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
	- Training dataset: RefCOCO(+/g)