ONNX version of intfloat/e5-large-v2

This is a sentence-transformers model: It maps sentences & paragraphs to a N dimensional dense vector space and can be used for tasks like clustering or semantic search.

The model conversion was made with onnx-convert tool with the following parameters:

python convert.sh --model_id intfloat/e5-large-v2 --quantize QInt8 --optimize 2

There are two versions of model available:

  • model.onnx - Float32 version, with optimize=2
  • model_opt2_QInt8.onnx - QInt8 quantized version, with optimize=2

License

Apache 2.0

Downloads last month
84
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train nixiesearch/e5-large-v2-onnx