jinaai
/

jina-clip-implementation

Inference Endpoints

🇪🇺 Region: EU

Model card Files Files and versions Community

jina-clip-implementation / README.md

gmastrapas's picture

docs: update README

a4480ad 12 days ago

|

846 Bytes

metadata

tags:
  - transformers
  - xlm-roberta
  - eva02
  - clip
library_name: transformers
license: cc-by-nc-4.0

Jina CLIP

Core implementation of Jina CLIP. The model uses:

the EVA 02 architecture for the vision tower
the Jina XLM RoBERTa with Flash Attention model as a text tower

Models that use this implementation

Requirements

To use the Jina CLIP source code, the following packages are required:

torch
timm
transformers
einops
xformers to use x-attention
flash-attn to use flash attention
apex to use fused layer normalization