metadata
license: openrail
model_creator: axiong
model_name: PMC_LLaMA_13B
PMC_LLaMA_13B - AWQ
- Model creator: axiong
- Original model: PMC_LLaMA_13B
Description
This repo contains AWQ model files for PMC_LLaMA_13B.
About AWQ
AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. Compared to GPTQ, it offers faster Transformers-based inference with equivalent or better quality compared to the most commonly used GPTQ settings.