Orion-zhen
/

Qwen2-72B-Instruct-2.0bpw-h-novel-exl2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Orion-zhen commited on Jun 12

Commit

8dfe84c

•

1 Parent(s): eb34a33

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -11,6 +11,12 @@ tags:
 # Qwen2-72B-Instruct
 ## Introduction
 Qwen2 is the new series of Qwen large language models. For Qwen2, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters, including a Mixture-of-Experts model. This repo contains the instruction-tuned 72B Qwen2 model.

 # Qwen2-72B-Instruct
+## 量化
+非常激进的2bpw量化, 采用[pixiv-novel](https://huggingface.co/datasets/Orion-zhen/pixiv-novel)作为校准数据集, 尽量减少模型在生成小说内容方面的困惑度, 保持对应领域的性能.
+本模型可以在一块消费级的24G显卡上加载运行, 配合[SillyTavern](https://github.com/SillyTavern/SillyTavern)食用更佳
 ## Introduction
 Qwen2 is the new series of Qwen large language models. For Qwen2, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters, including a Mixture-of-Experts model. This repo contains the instruction-tuned 72B Qwen2 model.