tomaarsen HF staff commited on
Commit
6ba1338
1 Parent(s): 5e95d41

Add exported openvino model 'openvino_model_qint8_quantized.xml'

Browse files

Hello!

*This pull request has been automatically generated from the [`export_static_quantized_openvino_model`](https://sbert.net/docs/package_reference/util.html#sentence_transformers.backend.export_static_quantized_openvino_model) function from the Sentence Transformers library.*

## Config
```python
OVQuantizationConfig(
quant_method=<OVQuantizationMethod.DEFAULT: 'default'>
)
```

## Tip:
Consider testing this pull request before merging by loading the model from this PR with the `revision` argument:
```python
from sentence_transformers import SentenceTransformer

# TODO: Fill in the PR number
pr_number = 2
model = SentenceTransformer(
"thenlper/gte-base",
revision=f"refs/pr/{pr_number}",
backend="openvino",
model_kwargs={"file_name": "openvino_model_qint8_quantized.xml"},
)

# Verify that everything works as expected
embeddings = model.encode(["The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium."])
print(embeddings.shape)

similarities = model.similarity(embeddings, embeddings)
print(similarities)
```

openvino/openvino_model_qint8_quantized.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ae01003e1707d90f170180dd211dec19e6dd01e8fcb83386cacbda3e3e0604c6
3
+ size 109974480
openvino/openvino_model_qint8_quantized.xml ADDED
The diff for this file is too large to render. See raw diff