Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nisten
/
qwenv2-7b-inst-imatrix-gguf
like
3
GGUF
Inference Endpoints
imatrix
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Deploy
Use this model
b7097b6
qwenv2-7b-inst-imatrix-gguf
1 contributor
History:
8 commits
nisten
calculated imatrix in 8bit, was jsut as good as f16 imatrix
b7097b6
verified
8 months ago
.gitattributes
2.31 kB
calculated imatrix in 8bit, was jsut as good as f16 imatrix
8 months ago
8bitimatrix.dat
4.54 MB
LFS
calculated imatrix in 8bit, was jsut as good as f16 imatrix
8 months ago
README.md
1.55 kB
Update README.md
8 months ago
qwen7bf16.gguf
15.2 GB
LFS
Upload 9 files
8 months ago
qwen7bq4kembeddingf16outputf16.gguf
6.11 GB
LFS
Rename qwen7bq4kembeddingbf16outputbf16.gguf to qwen7bq4kembeddingf16outputf16.gguf
8 months ago
qwen7bq4koutput8bit.gguf
4.82 GB
LFS
Upload 9 files
8 months ago
qwen7bq4xsembedding8output8.gguf
4.64 GB
LFS
Rename qwen7bq4xsembedding5bitkoutput8bit.gguf to qwen7bq4xsembedding8output8.gguf
8 months ago
qwen7bq4xsoutput6k.gguf
4.22 GB
LFS
Rename qwen7bq4xs.gguf to qwen7bq4xsoutput6k.gguf
8 months ago
qwen7bq4xsoutput8bit.gguf
4.35 GB
LFS
Upload 9 files
8 months ago
qwen7bq5km.gguf
5.58 GB
LFS
Upload 9 files
8 months ago
qwenq8v2.gguf
8.1 GB
LFS
Upload 9 files
8 months ago