danielheinz
/

e5-base-sts-en-de

Feature Extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

e5-base-sts-en-de / README.md

danielheinz's picture

Update README.md

f23cbcf verified 7 months ago

|

971 Bytes

	---
	license: mit
	datasets:
	- deutsche-telekom/ger-backtrans-paraphrase
	- paws-x
	- stsb_multi_mt
	language:
	- de
	model-index:
	- name: e5-base-sts-en-de
	results:
	- task:
	type: semantic textual similarity
	dataset:
	type: stsb_multi_mt
	name: stsb_multi_mt
	metrics:
	- type: spearmanr
	value: 0.904
	---
	The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.

	## Model Training
	The model has been fine-tuned on the German subsets of the following datasets:
	- [German paraphrase corpus by Philip May](https://huggingface.co/datasets/deutsche-telekom/ger-backtrans-paraphrase)
	- [paws-x](https://huggingface.co/datasets/paws-x)
	- [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)

	# Results
	The model achieves the following results:
	- 0.920 on stsb's validation subset
	- 0.904 on stsb's test subset