ONNX version of intfloat/e5-large-v2
This is a sentence-transformers model: It maps sentences & paragraphs to a N dimensional dense vector space and can be used for tasks like clustering or semantic search.
The model conversion was made with onnx-convert tool with the following parameters:
python convert.sh --model_id intfloat/e5-large-v2 --quantize QInt8 --optimize 2
There are two versions of model available:
model.onnx
- Float32 version, with optimize=2model_opt2_QInt8.onnx
- QInt8 quantized version, with optimize=2
License
Apache 2.0
- Downloads last month
- 84
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.