SliME Model Card

Model details

Model type:

SliME is an open-source chatbot trained by fine-tuning LLM on multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture. Base LLM: meta-llama/Meta-Llama-3-8B-Instruct

Paper or resources for more information: https://github.com/yfzhang114/SliME

License

Where to send questions or comments about the model: https://github.com/yfzhang114/SliME/issues

Intended use

Primary intended uses: The primary use of SliME is research on large multimodal models and chatbots.

Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.

Training dataset

SharedGPT4v sft data
SMR data

Evaluation dataset

A collection of 15 benchmarks, including 5 academic VQA benchmarks and 10 recent benchmarks specifically proposed for instruction-following LMMs.

yifanzhang114
/

SliME-Llama3-8B-lora

SliME Model Card

Model details

License

Intended use

Training dataset

Evaluation dataset

Dataset used to train yifanzhang114/SliME-Llama3-8B-lora

Collection including yifanzhang114/SliME-Llama3-8B-lora

SliME