Model broken?

#1
by wcde - opened

I tried several quants, all of them produce random set of characters. EXL2 works without problems.

@wcde Have you double checked your prompt format and more importantly the rope scale?

this one needs the rope scale set to 4, i'm not sure why llama.cpp seems to have ignored that during conversion.. it produces pure gibberish if the rope scale isn't set properly

Sign up or log in to comment