Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Llama2-7b-chat-pruned50-quant-ds
like
9
Follow
Neural Magic
158
Text Generation
Transformers
ONNX
llama
deepsparse
arxiv:
2301.00774
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama2-7b-chat-pruned50-quant-ds
1 contributor
History:
11 commits
mwitiderrick
Update README.md
e54ea37
10 months ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
11 months ago
README.md
Safe
4.06 kB
Update README.md
10 months ago
config.json
Safe
692 Bytes
Upload folder using huggingface_hub
11 months ago
model-orig.onnx
Safe
1.21 MB
LFS
Upload folder using huggingface_hub
11 months ago
model.data
Safe
7.42 GB
LFS
Upload folder using huggingface_hub
11 months ago
model.onnx
Safe
1.19 MB
LFS
Upload folder using huggingface_hub
11 months ago
recipe.yaml
Safe
1.15 kB
Create recipe.yaml
11 months ago
special_tokens_map.json
Safe
435 Bytes
Upload folder using huggingface_hub
11 months ago
tokenizer.json
Safe
1.84 MB
Upload folder using huggingface_hub
11 months ago
tokenizer.model
Safe
500 kB
LFS
Upload folder using huggingface_hub
11 months ago
tokenizer_config.json
Safe
1.01 kB
Upload folder using huggingface_hub
11 months ago