|
--- |
|
library_name: light-embed |
|
pipeline_tag: sentence-similarity |
|
tags: |
|
- sentence-transformers |
|
- feature-extraction |
|
- sentence-similarity |
|
|
|
--- |
|
|
|
# LightEmbed/baai-bge-base-en-v1.5-onnx |
|
|
|
This is the ONNX version of the Sentence Transformers model BAAI/bge-base-en-v1.5 for sentence embedding, optimized for speed and lightweight performance. By utilizing onnxruntime and tokenizers instead of heavier libraries like sentence-transformers and transformers, this version ensures a smaller library size and faster execution. Below are the details of the model: |
|
- Base model: BAAI/bge-base-en-v1.5 |
|
- Embedding dimension: 768 |
|
- Max sequence length: 512 |
|
- File size on disk: 0.41 GB |
|
- Pooling incorporated: Yes |
|
|
|
This ONNX model consists all components in the original sentence transformer model: |
|
Transformer, Pooling, Normalize |
|
|
|
<!--- Describe your model here --> |
|
|
|
## Usage (LightEmbed) |
|
|
|
Using this model becomes easy when you have [LightEmbed](https://pypi.org/project/light-embed/) installed: |
|
|
|
``` |
|
pip install -U light-embed |
|
``` |
|
|
|
Then you can use the model using the original model name like this: |
|
|
|
```python |
|
from light_embed import TextEmbedding |
|
sentences = [ |
|
"This is an example sentence", |
|
"Each sentence is converted" |
|
] |
|
|
|
model = TextEmbedding('BAAI/bge-base-en-v1.5') |
|
embeddings = model.encode(sentences) |
|
print(embeddings) |
|
``` |
|
|
|
Then you can use the model using onnx model name like this: |
|
|
|
```python |
|
from light_embed import TextEmbedding |
|
sentences = [ |
|
"This is an example sentence", |
|
"Each sentence is converted" |
|
] |
|
|
|
model = TextEmbedding('LightEmbed/baai-bge-base-en-v1.5-onnx') |
|
embeddings = model.encode(sentences) |
|
print(embeddings) |
|
``` |
|
|
|
## Citing & Authors |
|
|
|
Binh Nguyen / binhcode25@gmail.com |