q8 quantization

by darxkies - opened Nov 22, 2023

Discussion

darxkies

Nov 22, 2023

Could you also please upload the q8 quantization?

tastypear

Owner Nov 25, 2023

I tested this model and thought the quality of its responses was not good. It's not clear to me that the problem comes from the original model, my testing method, or the quantification. So I deleted the original files and decided to hold off on releasing any more versions.

If you can provide generated samples and test methods to prove that there are no problems with these quantized models, I will be happy to regenerate the q8 version.

darxkies

Nov 26, 2023

I've noticed the same and I was hoping that with q8 it would get more "stable".

darxkies changed discussion status to closed Nov 26, 2023

tastypear

Owner Nov 27, 2023

It's a pity that q8 is not good either. I just gave up. ┑(￣ω ￣)┍
Here are some other English/Chinese uncensored Llama models if you are interested: MiniChat-3B / CausalLM 7B-DPO-alpha / CausalLM 14-DPO-alpha.

darxkies

Nov 27, 2023

Thank you very much.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment