michaelfeil
/

ct2fast-paraphrase-multilingual-MiniLM-L12-v2

@@ -16,7 +16,7 @@ Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on
 quantized version of [sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2)
 ```bash
-pip install hf-hub-ctranslate2>=2.12.0 ctranslate2>=3.16.0
 ```
 ```python
@@ -56,16 +56,20 @@ embeddings = model.encode(
 print(embeddings.shape, embeddings)
 scores = (embeddings @ embeddings.T) * 100
 ```
-Checkpoint compatible to [ctranslate2>=3.16.0](https://github.com/OpenNMT/CTranslate2)
 and [hf-hub-ctranslate2>=2.12.0](https://github.com/michaelfeil/hf-hub-ctranslate2)
 - `compute_type=int8_float16` for `device="cuda"`
 - `compute_type=int8`  for `device="cpu"`
-Converted on 2023-06-19 using
 ```
-ct2-transformers-converter --model sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 --output_dir ~/tmp-ct2fast-paraphrase-multilingual-MiniLM-L12-v2 --force --copy_files unigram.json config_sentence_transformers.json tokenizer.json modules.json README.md tokenizer_config.json sentence_bert_config.json special_tokens_map.json .gitattributes --trust_remote_code
 ```
 # Licence and other remarks:

 quantized version of [sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2)
 ```bash
+pip install hf-hub-ctranslate2>=2.12.0 ctranslate2>=3.17.1
 ```
 ```python
 print(embeddings.shape, embeddings)
 scores = (embeddings @ embeddings.T) * 100
+# Hint: you can also host this code via REST API and
+# via github.com/michaelfeil/infinity
 ```
+Checkpoint compatible to [ctranslate2>=3.17.1](https://github.com/OpenNMT/CTranslate2)
 and [hf-hub-ctranslate2>=2.12.0](https://github.com/michaelfeil/hf-hub-ctranslate2)
 - `compute_type=int8_float16` for `device="cuda"`
 - `compute_type=int8`  for `device="cpu"`
+Converted on 2023-10-13 using
 ```
+LLama-2 -> removed <pad> token.
 ```
 # Licence and other remarks:

model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:03c883e3ae820352efa7873c2fbd04313278cda250f95ecec5804165ed92dbde
-size 470623404

 version https://git-lfs.github.com/spec/v1
+oid sha256:6c59f14a4df4ce59c1b82b6438ba6d3fbf5cb744ec3c9d49efab61cec2ea9425
+size 235315884

vocabulary.txt ADDED Viewed

The diff for this file is too large to render. See raw diff