Quantisation question?

by pjw000 - opened Apr 1, 2024

Apr 1, 2024

•

edited Apr 1, 2024

Using this model should I be able to create a GPTQ 4 bit version? My naive attempts using the official intruct model produce the following error which I have no idea how to deal with:

ValueError: Block pattern could not be match. Pass block_name_to_quantize argument in quantize_model

Specifically: I have no idea what the block name might be!

SinclairSchneider

Owner Apr 1, 2024

There are already people trying to get a GPTQ version. I'd suggest to have a look at this github discussion: https://github.com/AutoGPTQ/AutoGPTQ/issues/621

pjw000

Apr 2, 2024

Thank you. That's a VERY helpful link! Much appreciated.

pjw000 changed discussion status to closed Apr 2, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment