File size: 971 Bytes
b672aea
 
9aaae0e
 
 
 
 
 
57db5d2
 
 
 
 
 
 
f23cbcf
57db5d2
 
 
b672aea
9aaae0e
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
license: mit
datasets:
- deutsche-telekom/ger-backtrans-paraphrase
- paws-x
- stsb_multi_mt
language:
- de
model-index:
  - name: e5-base-sts-en-de
    results:
    - task:
        type: semantic textual similarity
      dataset:
        type: stsb_multi_mt
        name: stsb_multi_mt
      metrics:
        - type: spearmanr
          value: 0.904
---
The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.

## Model Training
The model has been fine-tuned on the German subsets of the following datasets:
- [German paraphrase corpus by Philip May](https://huggingface.co/datasets/deutsche-telekom/ger-backtrans-paraphrase)
- [paws-x](https://huggingface.co/datasets/paws-x)
- [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)

# Results
The model achieves the following results:
- 0.920 on stsb's validation subset
- 0.904 on stsb's test subset