fp32 for everything?

by neph1 - opened Feb 2

Feb 2

Considering the size of the safetensors file, it seems that both models are saved in fp32? I downloaded the previous version, an then only the vision model was fp32. Is this an unfortunate consequence of merging the models, or just an oversight?

I guess that an eventual quantization would solve the issue.

vikhyatk

Owner Feb 2

Oversight on my part. They were trained in 16 bit precision so no reason for the weights to be 32 bit here. Will try to fix soon.

vikhyatk

Owner Feb 7

Updated to fp16!

vikhyatk changed discussion status to closed Feb 7

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment