GGUF
Not-For-All-Audiences

4Q K_S request

#1
by mpt369 - opened

Hi, I'm curious about your model except my computer cannot run the 4q_k_m version without off-loading it into the RAM which is really slow, is a 4Q K_S version possible?

You could use either the iMatrix iQ4nl or iQ4xs they also provided instead of the Q4_K_S.

https://huggingface.co/MarsupialAI/Llama-3SOME-8B-v1-BETA_iMatrix_GGUF/tree/main

Thank you, I'll try that.

mpt369 changed discussion status to closed

Sign up or log in to comment