Citation

If you use this finetuned model checkpoint in your research, please cite our paper as follows:

      @misc{zhang2024visualquestiondecompositionmultimodal,
      title={Visual Question Decomposition on Multimodal Large Language Models}, 
      author={Haowei Zhang and Jianzhe Liu and Zhen Han and Shuo Chen and Bailan He and Volker Tresp and Zhiqiang Xu and Jindong Gu},
      year={2024},
      eprint={2409.19339},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2409.19339}, 
}

Downloads last month: 10

Safetensors

Model size

25.5B params

Tensor type

BF16

Inference Providers NEW

Visual Question Answering

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for freesky/InternVL-Chat-V1-5_ft_by_DecoVQA

Base model

OpenGVLab/InternVL-Chat-V1-5

Finetuned

(3)

this model