nisten
/

qwenv2-7b-inst-imatrix-gguf

Inference Endpoints

Model card Files Files and versions Community

qwenv2-7b-inst-imatrix-gguf

1 contributor

History: 8 commits

nisten's picture

calculated imatrix in 8bit, was jsut as good as f16 imatrix

b7097b6 verified 8 months ago

.gitattributes

2.31 kB

calculated imatrix in 8bit, was jsut as good as f16 imatrix 8 months ago
8bitimatrix.dat

4.54 MB
LFS

calculated imatrix in 8bit, was jsut as good as f16 imatrix 8 months ago
README.md

1.55 kB

Update README.md 8 months ago
qwen7bf16.gguf

15.2 GB
LFS

Upload 9 files 8 months ago
qwen7bq4kembeddingf16outputf16.gguf

6.11 GB
LFS

Rename qwen7bq4kembeddingbf16outputbf16.gguf to qwen7bq4kembeddingf16outputf16.gguf 8 months ago
qwen7bq4koutput8bit.gguf

4.82 GB
LFS

Upload 9 files 8 months ago
qwen7bq4xsembedding8output8.gguf

4.64 GB
LFS

Rename qwen7bq4xsembedding5bitkoutput8bit.gguf to qwen7bq4xsembedding8output8.gguf 8 months ago
qwen7bq4xsoutput6k.gguf

4.22 GB
LFS

Rename qwen7bq4xs.gguf to qwen7bq4xsoutput6k.gguf 8 months ago
qwen7bq4xsoutput8bit.gguf

4.35 GB
LFS

Upload 9 files 8 months ago
qwen7bq5km.gguf

5.58 GB
LFS

Upload 9 files 8 months ago
qwenq8v2.gguf

8.1 GB
LFS

Upload 9 files 8 months ago