more quantized versions？

#10

by Liangmingxin - opened Feb 3, 2024

Discussion

Liangmingxin

Feb 3, 2024

@TheBloke Can you provide awq and other quantized versions of this project?

davidsyoung

Feb 3, 2024

I would also really value an AWQ quantisation here as well.

152334H

Owner Feb 3, 2024

lonestriker has many exl2 exports, if that helps

MaziyarPanahi

Feb 3, 2024

I can do GPTQ and GGUF if they are not in progress already. (never done AWQ tbh)

ZanMax

Feb 3, 2024

@MaziyarPanahi , I would be very appreciative of the GPTQ version. Or maybe just instructions on how to do it.

MaziyarPanahi

Feb 4, 2024

@ZanMax I use the official script of AutoGPT which TheBloke also uses it. It just requires lots of GPU vRAM. I will start my script for this model

MaziyarPanahi

Feb 4, 2024

•

edited Feb 4, 2024

@ZanMax I made the GPTQ version of this model: MaziyarPanahi/miqu-1-70b-sf-GPTQ
PS: it requires 20GB-22GB of A100 and it takes around 4 hours to finish for GPTQ.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment