--- license: apache-2.0 language: - zh - en --- # Chinese-Alpaca-2-7B-64K This repository contains GGUF-v3 version (llama.cpp compatible) of **Chinese-Alpaca-2-7B-64K**, which is tuned on Chinese-Alpaca-2-7B with **YaRN method**. ## Performance Metric: PPL, lower is better | Quant | original | imatrix (`-im`) | |-----|------|------| | Q2_K | 9.8201 +/- 0.13298 | 10.3057 +/- 0.14197 | | Q3_K | 8.4435 +/- 0.11467 | 8.3556 +/- 0.11316 | | Q4_0 | 8.3573 +/- 0.11496 | - | | Q4_K | 8.0558 +/- 0.10948 | 8.0557 +/- 0.10964 | | Q5_0 | 8.0220 +/- 0.10954 | - | | Q5_K | 7.9388 +/- 0.10802 | 7.9440 +/- 0.10815 | | Q6_K | 7.9267 +/- 0.10792 | 7.9126 +/- 0.10775 | | Q8_0 | 7.9117 +/- 0.10773 | - | | F16 | 7.9124 +/- 0.10780 | - | *The model with `-im` suffix is generated with important matrix, which has generally better performance (not always though).* ## Others For full model in HuggingFace format, please see: https://huggingface.co/hfl/chinese-alpaca-2-7b-64k Please refer to [https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/) for more details.