Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
TariqJamil
/
Llama-2-13b-chat-q4bit
like
0
Text Generation
Transformers
llama
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
0a638d8
Llama-2-13b-chat-q4bit
1 contributor
History:
2 commits
TariqJamil
AutoGPTQ model for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False
0a638d8
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
config.json
640 Bytes
AutoGPTQ model for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False
about 1 year ago
gptq_model-4bit-128g.safetensors
7.26 GB
LFS
AutoGPTQ model for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False
about 1 year ago
quantize_config.json
249 Bytes
AutoGPTQ model for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False
about 1 year ago