Update README.md
Browse files
README.md
CHANGED
@@ -2604,6 +2604,11 @@ language:
|
|
2604 |
license: mit
|
2605 |
duplicated_from: michaelfeil/ct2fast-e5-small-v2
|
2606 |
---
|
|
|
|
|
|
|
|
|
|
|
2607 |
# # Fast-Inference with Ctranslate2
|
2608 |
Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on CPU or GPU.
|
2609 |
|
|
|
2604 |
license: mit
|
2605 |
duplicated_from: michaelfeil/ct2fast-e5-small-v2
|
2606 |
---
|
2607 |
+
# # Hugging Face Inference Endpoints -compatible version of michaelfeil/ct2fast-e5-small-v2
|
2608 |
+
Duplicate of michaelfeil/ct2fast-e5-small-v2, modified to run on Hugging Face Inference Endpoints.
|
2609 |
+
|
2610 |
+
Creates symbolic links so that ctranslate2 reads the repository model without downloading from HF.
|
2611 |
+
|
2612 |
# # Fast-Inference with Ctranslate2
|
2613 |
Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on CPU or GPU.
|
2614 |
|