nixiesearch
/

e5-base-v2-onnx

Feature Extraction

sentence-transformers

sentence-similarity

Inference Endpoints

Model card Files Files and versions Community

e5-base-v2-onnx / quantize_config.json

shuttie's picture

add quantized model

d5dc0b9 6 months ago

raw history blame contribute delete

No virus

674 Bytes

	{
	"per_channel": true,
	"reduce_range": true,
	"per_model_config": {
	"model": {
	"op_types": [
	"Mul",
	"Sub",
	"Softmax",
	"Sqrt",
	"Erf",
	"Slice",
	"Cast",
	"Div",
	"Gather",
	"Constant",
	"Unsqueeze",
	"Add",
	"Shape",
	"Reshape",
	"Transpose",
	"Pow",
	"ReduceMean",
	"Concat",
	"MatMul"
	],
	"weight_type": "QInt8"
	}
	}
	}