Why text_encoder model in the openclip (CLIP ViT-H) library is 3.94G, while the size in this library is 1.36G

#93

by MetaInsight - opened Jan 6, 2024

Jan 6, 2024

•

The model card states that OpenCLIP ViT/H is used, but the size is different
Does anyone know why？
openclip :https://huggingface.co/laion/CLIP-ViT-H-14-laion2B-s32B-b79K/tree/main

Feb 7, 2024

Yeah, That s big question. I couldnt project encoded hiddenstates. Bec. in this repo, there is no projection weights

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment