How is it quantized, and why does the configuration show float32?

by Sohaibsoussi - opened Jul 13

Jul 13

Hello 👋
I'm just curious about how did you quantize your model and push it on the hub since I tried multiple times and it failed 😔
I also saw in the configuration that it was in float32 , does it mean that hf do not accept qint8 format?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment