Text Generation
Transformers
PyTorch
English
llava
Inference Endpoints
Edit model card

This is a preview version of the Q-Instruct LLaVA. Non-finalized weights.

@misc{wu2023qinstruct,
      title={Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models}, 
      author={Haoning Wu and Zicheng Zhang and Erli Zhang and Chaofeng Chen and Liang Liao and Annan Wang and Kaixin Xu and Chunyi Li and Jingwen Hou and Guangtao Zhai and Geng Xue and Wenxiu Sun and Qiong Yan and Weisi Lin},
      year={2023},
      eprint={2311.06783},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}
Downloads last month
103

Datasets used to train teowu/llava_v1.5_7b_qinstruct_preview_v0.1