Model Description

This repo contains ONNX exports for the corresponding ViT-based multilingual CLIP model by OpenCLIP. See the OpenCLIP repo for more info. Visual and textual encoders are separated into separate models for the purpose of generating image and text embeddings.

This repo is specifically intended for use with Immich, a self-hosted photo library.

Downloads last month
675
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Collection including immich-app/XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k