ask

#12

by ReD2401 - opened May 4, 2023

Discussion

ReD2401

May 4, 2023

•

edited May 5, 2023

Hello TheBloke, I hope you are well.

I thank you for all your effort O)/

TheBloke

Owner May 4, 2023

•

edited May 4, 2023

I'm happy to take a look. There are some complications with GPT-J models. llama.cpp can't load them, and the latest and best 4bit quantisation code for CPU doesn't work with them. It would be possible to use them in GPT4ALL-Chat though, or there's a CLI version that also supports GPT-J for CPU.

GPTQ 4bit for GPU should be possible, but not as well supported.

I will give it a go and let you know!

ReD2401

May 4, 2023

thank you so much i really appreciate it

TheBloke

Owner May 6, 2023

•

edited May 6, 2023

Why did you edit the model out of your comment? Do you not want me to look at it any more? Or did someone else do it?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment