File size: 1,236 Bytes
e7417b6
 
 
 
 
75136d5
 
 
8556a5c
e7417b6
 
75136d5
e7417b6
75136d5
e7417b6
1f42b6f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
pipeline_tag: sentence-similarity
tags:
- feature-extraction
- sentence-similarity
license: mit
language:
- fr
- en
---

# Solon Embeddings — base 0.1

SOTA Open source french embedding model.

| Model | Mean Score |
| --- | --- |
| cohere/embed-multilingual-v3 | 0.7402 |
| OrdalieTech/Solon-embeddings-base-0.1 | 0.7306 |
| openai/ada-002 | 0.7290 |
| cohere/embed-multilingual-light-v3 | 0.6945 |
| antoinelouis/biencoder-camembert-base-mmarcoFR | 0.6826 |
| dangvantuan/sentence-camembert-large | 0.6756 |
| voyage/voyage-01 | 0.6753 |
| intfloat/multilingual-e5-large | 0.6660 |
| intfloat/multilingual-e5-base | 0.6597 |
| Sbert/paraphrase-multilingual-mpnet-base-v2 | 0.5975 |
| dangvantuan/sentence-camembert-base | 0.5456 |
| EuropeanParliament/eubert_embedding_v1 | 0.5063 |

These results have been obtained through 9 benchmarks on a variety of text similarity tasks (classification, reranking, STS) :
- AmazonReviewsClassification
- MassiveIntentClassification
- MassiveScenarioClassification
- MTOPDomainClassification
- MTOPIntentClassification
- STS22
- MiraclFRRerank
- OrdalieFRSTS
- OrdalieFRReranking

(evaluation script currently available here : github.com/netapy/mteb)

--------

(Large version comming soon)