4K_0 gone and 4K_XS?

#2
by supercharge19 - opened

What is the new quant? How does it compare with other 4K quants? And why did 4K0 disappear? K0 recently got boost in speed (couple of days ago, new release boosted speed, now all speeds will boost its speed to some extent, while generation quality remains the same, sorry forgot what PR number was that).

Thank you I found PR for this: https://github.com/ggerganov/llama.cpp/pull/5060

Inferring from discussion there, I think this is new K0 i.e. between Q3 and Q4.

Hi @supercharge19

There is a good discussion here: https://huggingface.co/MaziyarPanahi/Venomia-1.1-m7-Mistral-7B-Instruct-v0.2-slerp-GGUF/discussions/1#65e7e609a73ffb80b47fd7fd

I kept my list and just added one of these new _XS for the 4 bits for testings, hopefully, I can add more of the new ones once I know which ones are useful to the community.

Sign up or log in to comment