Libra
Collection
The official repo for the ICML2024 paper: Libra: Building Decoupled Vision System on Large Language Models
β’
3 items
β’
Updated
Libra: Building Decoupled Vision System on Large Language Models
This model was trained on image-text pairs for basic multi-modal understanding ability.
In addition to the pretrained weights in this repo, please download the pretrained CLIP model in huggingface and merge it into the path, as:
libra-base/
βββ ...
βββ openai-clip-vit-large-patch14-336/
βββ ...
The CLIP model can be downloaded here.