Feature Extraction
sentence-transformers
ONNX
English
bert
sentence-similarity
Inference Endpoints
text-embeddings-inference
shuttie commited on
Commit
813e1b5
1 Parent(s): 302d3d3

add qint/quint8 opt/deopt models

Browse files
model_quantized.onnx → model_QInt8_noopt.onnx RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c7516faf8a93c8be356e65b81a183bc5a4a782e921a6849bcd5fd86d40a03776
3
- size 336983601
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b8d549c909eaa095de37679f2a1968be69261b715ae54e073f082f661cde983
3
+ size 336926090
model_QInt8_opt.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5aab9ee2445265c6f279bec3d6f15185938ebfc9b766142eea87add46ec1b6a2
3
+ size 336334633
model_QUInt8_noopt.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c0950c1e7c4d6e383ad982a62e2260c05ee705851be09aef79c15f6f7d682be4
3
+ size 336926089
model_QUInt8_opt.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:63340c79d60363ed9980e9b02a8f678cca36c0be0eddb4f7f9e1f0bf63ea4fa6
3
+ size 336334630