Update README.md
Browse files
README.md
CHANGED
@@ -2607,7 +2607,7 @@ duplicated_from: michaelfeil/ct2fast-e5-small-v2
|
|
2607 |
# # Hugging Face Inference Endpoints -compatible version of michaelfeil/ct2fast-e5-small-v2
|
2608 |
Duplicate of michaelfeil/ct2fast-e5-small-v2, modified to run on Hugging Face Inference Endpoints.
|
2609 |
|
2610 |
-
Creates symbolic links so that ctranslate2 reads the repository model without downloading from HF.
|
2611 |
|
2612 |
# # Fast-Inference with Ctranslate2
|
2613 |
Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on CPU or GPU.
|
|
|
2607 |
# # Hugging Face Inference Endpoints -compatible version of michaelfeil/ct2fast-e5-small-v2
|
2608 |
Duplicate of michaelfeil/ct2fast-e5-small-v2, modified to run on Hugging Face Inference Endpoints.
|
2609 |
|
2610 |
+
Requires a GPU Instance type to run. Creates symbolic links so that ctranslate2 reads the repository model without downloading from HF.
|
2611 |
|
2612 |
# # Fast-Inference with Ctranslate2
|
2613 |
Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on CPU or GPU.
|