Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
zephyr-7b-beta-marlin
like
0
Follow
Neural Magic
262
Text Generation
Transformers
Safetensors
mistral
nm-vllm
marlin
int4
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2210.17323
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
zephyr-7b-beta-marlin
1 contributor
History:
8 commits
robertgshaw2
Update README.md
99a5e49
verified
10 months ago
quantization
Update quantization/apply_gptq_save_marlin.py
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
2.25 kB
Update README.md
10 months ago
config.json
Safe
996 Bytes
added models
11 months ago
model.safetensors
Safe
4.12 GB
LFS
added models
11 months ago
quantize_config.json
Safe
287 Bytes
added models
11 months ago
special_tokens_map.json
Safe
624 Bytes
added models
11 months ago
tokenizer.json
Safe
1.8 MB
added models
11 months ago
tokenizer.model
Safe
493 kB
LFS
added models
11 months ago
tokenizer_config.json
Safe
1.48 kB
added models
11 months ago