Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
RichardErkhov
/
wang7776_-_vicuna-7b-v1.3-attention-sparsity-20-gguf
like
0
GGUF
Inference Endpoints
arxiv:
2306.11695
arxiv:
2302.13971
arxiv:
2306.05685
Model card
Files
Files and versions
Community
Deploy
Use this model
main
wang7776_-_vicuna-7b-v1.3-attention-sparsity-20-gguf
1 contributor
History:
24 commits
RichardErkhov
uploaded readme
4a1c79b
verified
3 months ago
.gitattributes
3.37 kB
uploaded model
3 months ago
README.md
7.46 kB
uploaded readme
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_M.gguf
3.11 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_S.gguf
2.95 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_XS.gguf
2.8 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ4_NL.gguf
3.85 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ4_XS.gguf
3.65 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q2_K.gguf
2.53 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K.gguf
3.3 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K_L.gguf
3.6 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K_M.gguf
3.3 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K_S.gguf
2.95 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q4_0.gguf
3.83 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q4_1.gguf
4.24 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q4_K.gguf
4.08 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q4_K_M.gguf
4.08 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q4_K_S.gguf
3.86 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q5_0.gguf
4.65 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q5_1.gguf
5.06 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q5_K.gguf
4.78 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q5_K_M.gguf
4.78 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q5_K_S.gguf
4.65 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q6_K.gguf
5.53 GB
LFS
uploaded model
3 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q8_0.gguf
7.16 GB
LFS
uploaded model
3 months ago