deepvk/llava-saiga-8b
Image-Text-to-Text
•
Updated
•
235
•
15
Our datasets and models for Visual-Language Modeling
Note VLM model based on Saiga-8b and tuned in LLaVA setup
Note VLM model based on Gemma-2b and tuned with LoRA in LLaVA setup
Note GPT-based instruction dataset for LLaVA-style training
Note Translated version of GQA benchmark
Note Translated version of MMBench benchmark