Probably best speed to perplexity ratio of any 7b gguf model so far 0e76852 verified nisten commited on Jun 16
calculated imatrix in 8bit, was jsut as good as f16 imatrix b7097b6 verified nisten commited on Jun 16
Rename qwen7bq4xsembedding5bitkoutput8bit.gguf to qwen7bq4xsembedding8output8.gguf ee4c789 verified nisten commited on Jun 16
Rename qwen7bq4kembeddingbf16outputbf16.gguf to qwen7bq4kembeddingf16outputf16.gguf d9150dc verified nisten commited on Jun 16