Model Card
Bunny is a family of lightweight multimodal models.
Bunny-qwen1.5-1.8b-siglip-lora leverages Qwen1.5-1.8B as the language model backbone and SigLIP as the vision encoder. It is pretrained on LAION-2M and finetuned on Bunny-695K.
More details about this model can be found in GitHub.
License
This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.
- Downloads last month
- 7