qwenv2-7b-inst-imatrix-gguf / qwen7bv2inst_q4km_output8bit.gguf

Commit History

very good quant for speed/perplexity, embedding is at q4k
6c5e613
verified

nisten commited on