Text Generation
Transformers
Safetensors
imp
custom_code

😈 Imp

Introduction

Based on Imp-v1.5-3B-phi2, we reduce the resolution of the input image from 384 to 196, and retrain the model using the same settings to obtain Imp-v1.5-3B-196

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Citation

If you use our model or refer our work in your studies, please cite:

@article{imp2024,
  title={Imp: Highly Capable Large Multimodal Models for Mobile Devices},
  author={Shao, Zhenwei and Yu, Zhou and Yu, Jun and Ouyang, Xuecheng and Zheng, Lihao and Gai, Zhenbiao and Wang, Mingyang and Ding, Jiajun},
  journal={arXiv preprint arXiv:2405.12107},
  year={2024}
}
Downloads last month
10
Safetensors
Model size
3.19B params
Tensor type
FP16
Β·
Inference Examples
Inference API (serverless) does not yet support model repos that contain custom code.

Datasets used to train MILVLG/Imp-v1.5-3B-196

Collection including MILVLG/Imp-v1.5-3B-196