Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,22 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- ar
|
4 |
+
- en
|
5 |
+
pipeline_tag: text-generation
|
6 |
+
---
|
7 |
+
<!-- README_GPTQ.md-use-from-python start -->
|
8 |
+
## How to use this GPTQ model from Python code
|
9 |
+
|
10 |
+
### Install the necessary packages
|
11 |
+
|
12 |
+
Requires: Transformers 4.32.0 or later, Optimum 1.12.0 or later, and AutoGPTQ 0.4.2 or later.
|
13 |
+
|
14 |
+
```shell
|
15 |
+
pip3 install transformers>=4.32.0 optimum>=1.12.0
|
16 |
+
pip3 install auto-gptq --extra-index-url https://huggingface.github.io/autogptq-index/whl/cu118/ # Use cu117 if on CUDA 11.7
|
17 |
+
```
|
18 |
+
|
19 |
+
### You can then generate a simple gradioweb with_quant.py
|
20 |
+
'''bash
|
21 |
+
python web_quant.py --model-name ${model-path}
|
22 |
+
'''
|