LightEmbed/sbert-all-mpnet-base-v2-onnx
This is the ONNX version of the Sentence Transformers model sentence-transformers/all-mpnet-base-v2 for sentence embedding, optimized for speed and lightweight performance. By utilizing onnxruntime and tokenizers instead of heavier libraries like sentence-transformers and transformers, this version ensures a smaller library size and faster execution. Below are the details of the model:
- Base model: sentence-transformers/all-mpnet-base-v2
- Embedding dimension: 768
- Max sequence length: 384
- File size on disk: 0.41 GB
- Pooling incorporated: Yes
This ONNX model consists all components in the original sentence transformer model: Transformer, Pooling, Normalize
Usage (LightEmbed)
Using this model becomes easy when you have LightEmbed installed:
pip install -U light-embed
Then you can use the model using the original model name like this:
from light_embed import TextEmbedding
sentences = [
"This is an example sentence",
"Each sentence is converted"
]
model = TextEmbedding('sentence-transformers/all-mpnet-base-v2')
embeddings = model.encode(sentences)
print(embeddings)
Then you can use the model using onnx model name like this:
from light_embed import TextEmbedding
sentences = [
"This is an example sentence",
"Each sentence is converted"
]
model = TextEmbedding('LightEmbed/sbert-all-mpnet-base-v2-onnx')
embeddings = model.encode(sentences)
print(embeddings)
Citing & Authors
Binh Nguyen / binhcode25@gmail.com
- Downloads last month
- 7
Inference API (serverless) does not yet support light-embed models for this pipeline type.