Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Sparse-Llama-3.1-8B-2of4-GGUF
like
4
Follow
Quant Factory
281
Text Generation
GGUF
vllm
sparsity
Inference Endpoints
arxiv:
2301.00774
arxiv:
2310.06927
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
0a0605b
Sparse-Llama-3.1-8B-2of4-GGUF
1 contributor
History:
18 commits
aashish1904
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_8.gguf with huggingface_hub
0a0605b
verified
23 days ago
.gitattributes
2.68 kB
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_8.gguf with huggingface_hub
23 days ago
README.md
6.06 kB
Upload README.md with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q2_K.gguf
3.18 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q2_K.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_L.gguf
4.32 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_L.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf
4.02 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf
3.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0.gguf
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0_4_8.gguf
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_8.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q4_1.gguf
5.13 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_1.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
4.92 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf
4.69 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q5_0.gguf
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_0.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q5_1.gguf
6.07 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_1.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf
5.73 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q6_K.gguf
6.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q6_K.gguf with huggingface_hub
23 days ago
Sparse-Llama-3.1-8B-2of4.Q8_0.gguf
8.54 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q8_0.gguf with huggingface_hub
23 days ago