Maybe need requant and IQ3_S models?

by Cran-May - opened May 1

May 1

as title.

May 1

IQ3_S is just fit for 4GB VRAM devices running 8B models.(IQ3_M is best for 7B models.)

Owner May 1

I'd need to try to redo these quants in the latest llamacpp and if do I'll include the IQ3_S.

Owner May 2

•

These will be reuploaded with the new llamacpp version.

Lewdiculous changed discussion status to closed May 2

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment