Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nisten
/
qwenv2-7b-inst-imatrix-gguf
like
3
GGUF
Inference Endpoints
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Deploy
Use this model
9cbd6f2
qwenv2-7b-inst-imatrix-gguf
/
qwen7bv2inst_q4km_output8bit.gguf
Commit History
very good quant for speed/perplexity, embedding is at q4k
6c5e613
verified
nisten
commited on
Jun 16