michaelfeil
commited on
Commit
•
13661f9
1
Parent(s):
31e97fe
Update README.md
Browse files
README.md
CHANGED
@@ -5627,7 +5627,7 @@ Usage via [infinity, MIT Licensed](https://github.com/michaelfeil/infinity).
|
|
5627 |
docker run \
|
5628 |
--gpus "0" -p "7997":"7997" \
|
5629 |
michaelf34/infinity:0.0.68-trt-onnx \
|
5630 |
-
v2 --model-id Alibaba-NLP/gte-Qwen2-1.5B-instruct --revision "refs/pr/20" --dtype bfloat16 --batch-size 16 --device cuda --engine torch --port 7997
|
5631 |
```
|
5632 |
|
5633 |
## Evaluation
|
|
|
5627 |
docker run \
|
5628 |
--gpus "0" -p "7997":"7997" \
|
5629 |
michaelf34/infinity:0.0.68-trt-onnx \
|
5630 |
+
v2 --model-id Alibaba-NLP/gte-Qwen2-1.5B-instruct --revision "refs/pr/20" --dtype bfloat16 --batch-size 16 --device cuda --engine torch --port 7997 --no-bettertransformer
|
5631 |
```
|
5632 |
|
5633 |
## Evaluation
|