-
sentence-transformers/gooaq
Viewer • Updated • 3.01M • 575 • 10 -
sentence-transformers/yahoo-answers
Viewer • Updated • 3.14M • 337 • 3 -
sentence-transformers/msmarco-msmarco-distilbert-base-tas-b
Viewer • Updated • 86.3M • 1.22k • 4 -
sentence-transformers/msmarco-msmarco-distilbert-base-v3
Viewer • Updated • 88.9M • 701 • 2
Sentence Transformers
university
AI & ML interests
In the following you find models tuned to be used for sentence / text embedding generation. They can be used with the sentence-transformers package.
Organization Card
SentenceTransformers 🤗 is a Python framework for state-of-the-art sentence, text and image embeddings.
Install the Sentence Transformers library.
pip install -U sentence-transformers
The usage is as simple as:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer('paraphrase-MiniLM-L6-v2')
# Sentences we want to encode. Example:
sentence = ['This framework generates embeddings for each input sentence']
# Sentences are encoded by calling model.encode()
embedding = model.encode(sentence)
Hugging Face makes it easy to collaboratively build and showcase your Sentence Transformers models! You can collaborate with your organization, upload and showcase your own models in your profile ❤️
Documentation
Push your Sentence Transformers models to the Hub ❤️
Find all Sentence Transformers models on the 🤗 Hub
To upload your Sentence Transformers models to the Hugging Face Hub, log in with huggingface-cli login
and use the save_to_hub
method within the Sentence Transformers library.
from sentence_transformers import SentenceTransformer
# Load or train a model
model = SentenceTransformer(...)
# Push to Hub
model.push_to_hub("my_new_model")
Collections
3
A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers
These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual.
-
sentence-transformers/parallel-sentences-wikititles
Viewer • Updated • 14.7M • 95 -
sentence-transformers/parallel-sentences-tatoeba
Viewer • Updated • 8.35M • 1.17k -
sentence-transformers/parallel-sentences-talks
Viewer • Updated • 19.6M • 3.26k • 7 -
sentence-transformers/parallel-sentences-europarl
Viewer • Updated • 49.7M • 1.02k
models
124
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity
•
Updated
•
96.6M
•
•
2.57k
sentence-transformers/all-mpnet-base-v2
Sentence Similarity
•
Updated
•
415M
•
•
894
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity
•
Updated
•
11.7M
•
692
sentence-transformers/paraphrase-multilingual-mpnet-base-v2
Sentence Similarity
•
Updated
•
2.08M
•
323
sentence-transformers/LaBSE
Sentence Similarity
•
Updated
•
395k
•
230
sentence-transformers/all-MiniLM-L12-v2
Sentence Similarity
•
Updated
•
7.93M
•
203
sentence-transformers/distiluse-base-multilingual-cased-v2
Sentence Similarity
•
Updated
•
776k
•
163
sentence-transformers/multi-qa-mpnet-base-dot-v1
Sentence Similarity
•
Updated
•
1.63M
•
158
sentence-transformers/clip-ViT-B-32-multilingual-v1
Sentence Similarity
•
Updated
•
240k
•
143
sentence-transformers/multi-qa-MiniLM-L6-cos-v1
Sentence Similarity
•
Updated
•
3.96M
•
117
datasets
76
sentence-transformers/parallel-sentences
Preview
•
Updated
•
995
•
13
sentence-transformers/embedding-training-data
Updated
•
639
•
106
sentence-transformers/parallel-sentences-opus-100
Viewer
•
Updated
•
55M
•
4.71k
•
1
sentence-transformers/trivia-qa-triplet
Viewer
•
Updated
•
52.9M
•
867
•
5
sentence-transformers/t2ranking
Viewer
•
Updated
•
5.53M
•
300
sentence-transformers/mr-tydi
Viewer
•
Updated
•
5.01M
•
1.57k
sentence-transformers/miracl
Viewer
•
Updated
•
8.95M
•
1.81k
•
2
sentence-transformers/mldr
Viewer
•
Updated
•
912k
•
1.24k
•
3
sentence-transformers/pubmedqa
Viewer
•
Updated
•
35.4k
•
180
sentence-transformers/lecard-v2
Viewer
•
Updated
•
13k
•
76