Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Sparse-Llama-3.1-8B-2of4-GGUF
like
4
Follow
Quant Factory
307
Text Generation
GGUF
vllm
sparsity
Inference Endpoints
arxiv:
2301.00774
arxiv:
2310.06927
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
6f2fa5b
Sparse-Llama-3.1-8B-2of4-GGUF
/
README.md
Commit History
Upload README.md with huggingface_hub
6f2fa5b
verified
aashish1904
commited on
Nov 27