metadata
license: apache-2.0
pipeline_tag: image-to-text
Libra-Chat
Libra: Building Decoupled Vision System on Large Language Models
This model was further finetuned with instructions based on Libra-Base for multi-modal chat.
!!! NOTE !!!
In addition to the pretrained weights in this repo, please download the pretrained CLIP model in huggingface and merge it into the path, as:
libra-chat/
βββ ...
βββ openai-clip-vit-large-patch14-336/
βββ ...
The CLIP model can be downloaded here.