HRuiii
/

Meta-Llama-3-8B-Instruct-GPTQ-4bit

Text Generation

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Meta-Llama-3-8B-Instruct-GPTQ-4bit

1 contributor

History: 2 commits

HRuiii's picture

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True

502546c verified 5 months ago

.gitattributes

1.52 kB

initial commit 5 months ago
config.json

1.04 kB

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True 5 months ago
gptq_model-4bit-128g.safetensors

5.74 GB
LFS

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True 5 months ago
quantize_config.json

264 Bytes

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True 5 months ago
special_tokens_map.json

296 Bytes

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True 5 months ago
tokenizer.json

9.09 MB

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True 5 months ago
tokenizer_config.json

51 kB

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True 5 months ago