Qwen
/

Qwen-7B

Text Generation

Model card Files Files and versions Community

yangapku commited on Aug 5, 2023

Commit

4d36b10

•

1 Parent(s): 583aaa0

update readme

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -49,7 +49,6 @@ For more details about the open-source model of Qwen-7B, please refer to the [Gi
 * 建议使用CUDA 11.4及以上（GPU用户、flash-attention用户等需考虑此选项）
 * python 3.8 and above
 * pytorch 1.12 and above, 2.0 and above are recommended
 * CUDA 11.4 and above are recommended (this is for GPU users, flash-attention users, etc.)
@@ -58,7 +57,7 @@ For more details about the open-source model of Qwen-7B, please refer to the [Gi
 运行Qwen-7B，请确保满足上述要求，再执行以下pip命令安装依赖库
-To run Qwen-7B, please make sure that pytorch version is not lower than 1.12, and then execute the following pip commands to install the dependent libraries.
 ```bash
 pip install transformers==4.31.0 accelerate tiktoken einops
@@ -321,9 +320,9 @@ We introduce NTK-aware interpolation, LogN attention scaling, Window attention,
 ## 量化（Quantization）
-如希望使用更低精度的量化模型，如4比特和8比特的模型，我们提供了简单的示例来说明如何快速使用量化模型。在开始前，确保你已经安装了`bitsandbytes`。请注意：`bitsandbytes`的安装要求是：
-We provide examples to show how to load models in `NF4` and `Int8`. For starters, make sure you have implemented `bitsandbytes`. Note that the requirements for `bitsandbytes` is:
 ```
 **Requirements** Python >=3.8. Linux distribution (Ubuntu, MacOS, etc.) + CUDA > 10.0.
@@ -342,7 +341,7 @@ pip install bitsandbytes
 Then you only need to add your quantization configuration to `AutoModelForCausalLM.from_pretrained`. See the example below:
 ```python
-from transformers import BitsAndBytesConfig
 # quantization configuration for NF4 (4 bits)
 quantization_config = BitsAndBytesConfig(

 * 建议使用CUDA 11.4及以上（GPU用户、flash-attention用户等需考虑此选项）
 * python 3.8 and above
 * pytorch 1.12 and above, 2.0 and above are recommended
 * CUDA 11.4 and above are recommended (this is for GPU users, flash-attention users, etc.)
 运行Qwen-7B，请确保满足上述要求，再执行以下pip命令安装依赖库
+To run Qwen-7B, please make sure you meet the above requirements, and then execute the following pip commands to install the dependent libraries.
 ```bash
 pip install transformers==4.31.0 accelerate tiktoken einops
 ## 量化（Quantization）
+如希望使用更低精度的量化模型，如4比特和8比特的模型，我们提供了简单的示例来说明如何快速使用量化模型。在开始前，确保你已经安装了`bitsandbytes`。请注意，`bitsandbytes`的安装要求是：
+We provide examples to show how to load models in `NF4` and `Int8`. For starters, make sure you have implemented `bitsandbytes`. Note that the requirements for `bitsandbytes` are:
 ```
 **Requirements** Python >=3.8. Linux distribution (Ubuntu, MacOS, etc.) + CUDA > 10.0.
 Then you only need to add your quantization configuration to `AutoModelForCausalLM.from_pretrained`. See the example below:
 ```python
+from transformers import AutoModelForCausalLM, BitsAndBytesConfig
 # quantization configuration for NF4 (4 bits)
 quantization_config = BitsAndBytesConfig(