MCG-NJU
/

p-MoD-LLaVA-v1.5-7B

Image-Text-to-Text

pmod_llava_llama

Model card Files Files and versions Community

p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay

This is the official model checkpoint of p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay. Please refer to this repository for our code.

Model Description

This model is pretrained on LCS-558K image caption data, and instruction-tuned on llava-v1_5-mix-665k.

Citation

If you find our work helpful for your research and applications, please cite our paper:

@article{zhang2024pmod,
  title={p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay},
  author={Zhang, Jun and Meng, Desen and Qi, Ji and Huang, Zhenpeng and Wu, Tao and Wang, Limin},
  journal={arXiv preprint arXiv:2412.04449},
  year={2024}
}

License

Llama 2 is licensed under the LLAMA 2 Community License, Copyright (c) Meta Platforms, Inc. All Rights Reserved.

Downloads last month: 25

Safetensors

Model size

7.06B params

Tensor type

BF16

·

Inference API

Image-Text-to-Text

Unable to determine this model's library. Check the docs .

Model tree for MCG-NJU/p-MoD-LLaVA-v1.5-7B

Base model

lmsys/vicuna-7b-v1.5

Finetuned

(45)

this model