qwenv2-7b-inst-imatrix-gguf / qwen7bv2inst_q4km_embedding4k_output8bit.gguf

Commit History

very good quant for speed/perplexity, embedding is at q4k
3107dcd
verified

nisten commited on