Orion-zhen
commited on
Commit
•
8dfe84c
1
Parent(s):
eb34a33
Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,12 @@ tags:
|
|
11 |
|
12 |
# Qwen2-72B-Instruct
|
13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
## Introduction
|
15 |
|
16 |
Qwen2 is the new series of Qwen large language models. For Qwen2, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters, including a Mixture-of-Experts model. This repo contains the instruction-tuned 72B Qwen2 model.
|
|
|
11 |
|
12 |
# Qwen2-72B-Instruct
|
13 |
|
14 |
+
## 量化
|
15 |
+
|
16 |
+
非常激进的2bpw量化, 采用[pixiv-novel](https://huggingface.co/datasets/Orion-zhen/pixiv-novel)作为校准数据集, 尽量减少模型在生成小说内容方面的困惑度, 保持对应领域的性能.
|
17 |
+
|
18 |
+
本模型可以在一块消费级的24G显卡上加载运行, 配合[SillyTavern](https://github.com/SillyTavern/SillyTavern)食用更佳
|
19 |
+
|
20 |
## Introduction
|
21 |
|
22 |
Qwen2 is the new series of Qwen large language models. For Qwen2, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters, including a Mixture-of-Experts model. This repo contains the instruction-tuned 72B Qwen2 model.
|