Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
WGNW
/
Llama-2-ko-7b-Chat-auto-gptq-4bit
like
2
Text Generation
Transformers
Safetensors
llama
text-generation-inference
Inference Endpoints
4-bit precision
gptq
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
da34f32
Llama-2-ko-7b-Chat-auto-gptq-4bit
1 contributor
History:
8 commits
WGNW
Update config.json
da34f32
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
config.json
Safe
910 Bytes
Update config.json
about 1 year ago
model.safetensors
Safe
4.13 GB
LFS
Rename gptq_model-4bit-128g.safetensors to model.safetensors
about 1 year ago
quantize_config.json
Safe
211 Bytes
AutoGPTQ model for kfkas/Llama-2-ko-7b-Chat: 4bits
about 1 year ago