seonglae
/

llama-2-13b-chat-hf-gptq

Text Generation

text-generation-inference

Model card Files Files and versions Community

llama-2-13b-chat-hf-gptq

1 contributor

History: 5 commits

seonglae's picture

Update README.md

584121a 12 months ago

.gitattributes

1.52 kB

initial commit 12 months ago
README.md

995 Bytes

Update README.md 12 months ago
config.json

625 Bytes

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False 12 months ago
generation_config.json

170 Bytes

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False 12 months ago
gptq_model-4bit-128g.safetensors

7.26 GB
LFS

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False 12 months ago
quantize_config.json

225 Bytes

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False 12 months ago