michaelfeil
commited on
Commit
•
d3a3b70
1
Parent(s):
d4aa690
Update Readme instructions with infinity
Browse files
README.md
CHANGED
@@ -2909,6 +2909,14 @@ async def main():
|
|
2909 |
asyncio.run(main())
|
2910 |
```
|
2911 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2912 |
### Usage for Reranker
|
2913 |
|
2914 |
Different from embedding model, reranker uses question and document as input and directly output similarity instead of embedding.
|
|
|
2909 |
asyncio.run(main())
|
2910 |
```
|
2911 |
|
2912 |
+
or via Infinity Docker Image
|
2913 |
+
|
2914 |
+
```bash
|
2915 |
+
docker run --gpus all -v $PWD/data:/app/.cache -p "7999":"7997" \
|
2916 |
+
michaelf34/infinity:0.0.68 \
|
2917 |
+
v2 --model-id BAAI/bge-large-en-v1.5 --revision "main" --dtype float16 --batch-size 32 --device cuda --engine torch --port 7997
|
2918 |
+
```
|
2919 |
+
|
2920 |
### Usage for Reranker
|
2921 |
|
2922 |
Different from embedding model, reranker uses question and document as input and directly output similarity instead of embedding.
|