nisten
/

qwenv2-7b-inst-imatrix-gguf

Inference Endpoints

Model card Files Files and versions Community

qwenv2-7b-inst-imatrix-gguf / qwen7bv2inst_q4km_output8bit.gguf

Commit History

very good quant for speed/perplexity, embedding is at q4k

6c5e613
verified

nisten commited on Jun 16