xverse
/

XVERSE-7B-Chat-GPTQ-Int8

Text Generation

8-bit precision

Model card Files Files and versions Community

willhe-xverse commited on Mar 25

Commit

0a04e6d

•

1 Parent(s): 4d40a72

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ inference: false
 ---
-# XVERSE-7B-Chat
 ## 模型介绍
@@ -67,7 +67,7 @@ for output in outputs:
 ## Usage
-We demonstrated how to use 'vllm' to run the XVERSE-7B-Chat GPTQ Int8 quantization model:
 ```python
 from vllm import LLM, SamplingParams

 ---
+# XVERSE-7B-Chat-GPTQ-Int8
 ## 模型介绍
 ## Usage
+We demonstrated how to use 'vllm' to run the XVERSE-7B-Chat-GPTQ-Int8 quantization model:
 ```python
 from vllm import LLM, SamplingParams