OpenGVLab
/

InternVL-Chat-V1-5-AWQ

Image-Text-to-Text

feature-extraction

Model card Files Files and versions Community

czczup commited on Dec 18, 2024

Commit

7758498

·

verified ·

1 Parent(s): 8f1c04a

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -39,7 +39,7 @@ LMDeploy supports the following NVIDIA GPU for W4A16 inference:
 Before proceeding with the quantization and inference, please ensure that lmdeploy is installed.
 ```shell
-pip install lmdeploy>=0.5.3
 ```
 This article comprises the following sections:
@@ -74,7 +74,7 @@ For more information about the pipeline parameters, please refer to [here](https
 LMDeploy's `api_server` enables models to be easily packed into services with a single command. The provided RESTful APIs are compatible with OpenAI's interfaces. Below are an example of service startup:
 ```shell
-lmdeploy serve api_server OpenGVLab/InternVL-Chat-V1-5-AWQ --backend turbomind --server-port 23333 --model-format awq
 ```
 To use the OpenAI-style interface, you need to install OpenAI:

 Before proceeding with the quantization and inference, please ensure that lmdeploy is installed.
 ```shell
+pip install lmdeploy>=0.6.4
 ```
 This article comprises the following sections:
 LMDeploy's `api_server` enables models to be easily packed into services with a single command. The provided RESTful APIs are compatible with OpenAI's interfaces. Below are an example of service startup:
 ```shell
+lmdeploy serve api_server OpenGVLab/InternVL-Chat-V1-5-AWQ --server-port 23333 --model-format awq
 ```
 To use the OpenAI-style interface, you need to install OpenAI: