This folder contains pre-computed search results for Chinese-LLaMA-2 and Chinese-Alpaca-2 models, which is used to generate AWQ (Activation-aware Weight Quantization) models.

WARNING: These models MUST BE used with the original weights.

For usage, see:

AWQ official github page: https://github.com/mit-han-lab/llm-awq
llama.cpp github page: https://github.com/ggerganov/llama.cpp/tree/master/awq-py

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support