Request for quantized version

by sudhir2016 - opened Jan 26

Jan 26

A quantized version of the model which can be used for inference in a free tier Google Colab notebook would be nice.

jisx

MaLA-LM org Jan 30

Feb 1

Yes please. Will it work with load_in_4bit=True.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment