vanewu commited on
Commit
935be87
1 Parent(s): 7991465

fixed readme.

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ The inference speed of lyraChatGLM has achieved **10x** acceleration upon the or
16
  Among its main features are:
17
 
18
  - weights: original ChatGLM-6B weights released by THUDM.
19
- - device: lyraChatGLM is mainly based on FasterTransformer compiled for SM=80 (A100, for example), but a lot faster.
20
  - batch_size: compiled with dynamic batch size, max batch_size = 8
21
 
22
  ## Speed
 
16
  Among its main features are:
17
 
18
  - weights: original ChatGLM-6B weights released by THUDM.
19
+ - device: lyraChatGLM is mainly based on TensorRT compiled for SM=80 (A100, for example).
20
  - batch_size: compiled with dynamic batch size, max batch_size = 8
21
 
22
  ## Speed