Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
/
SparseLlama-3-8B-pruned_50.2of4-FP8
like
0
Follow
NM Testing
32
Text Generation
Transformers
Safetensors
llama
sparse
fp8
vllm
text-generation-inference
Inference Endpoints
arxiv:
8 papers
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
SparseLlama-3-8B-pruned_50.2of4-FP8
Commit History
Update README.md
75cbe95
verified
mgoin
commited on
Jun 25
Update README.md (
#1
)
265e297
verified
mgoin
alexmarques
commited on
Jun 25
Update README.md
4eb2935
verified
mgoin
commited on
Jun 21
Update README.md
736010e
verified
mgoin
commited on
Jun 20
Update README.md
dfa71fc
verified
mgoin
commited on
Jun 20
Create README.md
9492007
verified
mgoin
commited on
Jun 20
Upload folder using huggingface_hub
13166f9
verified
mgoin
commited on
Jun 20
initial commit
4854e68
verified
mgoin
commited on
Jun 20