Qdrant
/

all_miniLM_L6_v2_with_attentions

Sentence Similarity

feature-extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

Added model card

#1

by Anush008 - opened Jul 10

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +37 -3

README.md CHANGED Viewed

@@ -1,3 +1,37 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+- en
+pipeline_tag: sentence-similarity
+---
+ONNX port of [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) adjusted to return attention weights.
+This model is intended to be used for [BM42 searches](https://qdrant.tech/articles/bm42/).
+### Usage
+Here's an example of performing inference using the model with [FastEmbed](https://github.com/qdrant/fastembed).
+```py
+from fastembed import SparseTextEmbedding
+documents = [
+    "You should stay, study and sprint.",
+    "History can only prepare us to be surprised yet again.",
+]
+model = SparseTextEmbedding(model_name="Qdrant/bm42-all-minilm-l6-v2-attentions")
+embeddings = list(model.embed(documents))
+# [
+#     SparseEmbedding(values=array([0.26399775, 0.24662513, 0.47077307]),
+#                     indices=array([1881538586, 150760872, 1932363795])),
+#     SparseEmbedding(values=array(
+#         [0.38320042, 0.25453135, 0.18017513, 0.30432631, 0.1373556]),
+#                     indices=array([
+#                         733618285, 1849833631, 1008800696, 2090661150,
+#                         1117393019
+#                     ]))
+# ]
+```