Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Sparse-Llama-3.1-8B-2of4-GGUF
like
4
Follow
Quant Factory
293
Text Generation
GGUF
vllm
sparsity
Inference Endpoints
arxiv:
2301.00774
arxiv:
2310.06927
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
45cd383
Sparse-Llama-3.1-8B-2of4-GGUF
1 contributor
History:
19 commits
aashish1904
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_8_8.gguf with huggingface_hub
45cd383
verified
27 days ago
.gitattributes
Safe
2.75 kB
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_8_8.gguf with huggingface_hub
27 days ago
README.md
Safe
6.06 kB
Upload README.md with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q2_K.gguf
Safe
3.18 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q2_K.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_L.gguf
Safe
4.32 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_L.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf
Safe
4.02 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf
Safe
3.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0.gguf
Safe
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf
Safe
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0_4_8.gguf
Safe
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_8.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0_8_8.gguf
Safe
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_8_8.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_1.gguf
Safe
5.13 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_1.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
Safe
4.92 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf
Safe
4.69 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q5_0.gguf
Safe
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_0.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q5_1.gguf
Safe
6.07 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_1.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf
Safe
5.73 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf
Safe
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q6_K.gguf
Safe
6.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q6_K.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q8_0.gguf
Safe
8.54 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q8_0.gguf with huggingface_hub
27 days ago