here is a google colab with 1.1 support

by eucdee - opened Apr 17, 2023

Discussion

eucdee

Apr 17, 2023

https://colab.research.google.com/github/eucdee/AI/blob/main/4bit_TextGen_Gdrive.ipynb

TheBloke

Owner Apr 17, 2023

Very cool!

Thireus

Apr 19, 2023

@eucdee thanks for sharing. You can try the triton branch of GPTQ-for-LLaMa for (hopefully) better perfs now, the code has been fixed (remove "-b cuda").

TheBloke

Owner Apr 19, 2023

@eucdee thanks for sharing. You can try the triton branch of GPTQ-for-LLaMa for (hopefully) better perfs now, the code has been fixed (remove "-b cuda").

Oh really? Qwopqwop's latest commits means it works in text-generation-webui again? If so that's great news.

Thireus

Apr 19, 2023

•

edited Apr 19, 2023

@TheBloke , there is a little bug still: https://github.com/oobabooga/text-generation-webui/issues/1343#issuecomment-1513070072

So you need to edit text-generation-webui/modules/GPTQ_loader.py:

#from modelutils import find_layers
from utils import find_layers

Tested with:

oobabooga/text-generation-webui commit 9d9ae6293833ce31bbb5ed5d9a04b033d1e3896d
qwopqwop200/GPTQ-for-LLaMa commit d89cdcd8b53f61346290a28d326816af6a028434

TheBloke

Owner Apr 20, 2023

Great, thanks for the details!

TheBloke

Owner Apr 20, 2023

@eucdee I added a link to your Colab in the README. Thanks for providing it!

TheBloke changed discussion status to closed Apr 20, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment