metadata
inference: false
Model Card
Bunny is a family of lightweight multimodal models.
Bunny-phi-1.5-eva-lora leverages Phi-1.5 as the language model backbone and EVA-CLIP as the vision encoder. It is pretrained on LAION-2M and finetuned on Bunny-695K.
More details about this model can be found in GitHub.