winstxnhdw
/

Meta-Llama-3.1-8B-Instruct-AWQ

Model card Files Files and versions Community

Meta-Llama-3.1-8B-Instruct-AWQ / README.md

winstxnhdw

Update README.md

2874ae2 verified 2 months ago

preview code

raw

history blame contribute delete

344 Bytes

Meta-Llama-3.1-8B-Instruct-AWQ

The following command was used to produce this model.

python quantize.py --model_dir /Meta-Llama-3.1-8B-Instruct \
                   --output_dir /Meta-Llama-3.1-8B-Instruct-AWQ \
                   --dtype bfloat16 \
                   --qformat int4_awq \
                   --awq_block_size 64