sinequa
/

passage-ranker.pistachio

Text Classification

Transformers

PyTorch

bert

Inference Endpoints

Model card Files Files and versions Community

clarine commited on Jul 17, 2024

Commit

4115910

1 Parent(s): 44a46d4

Unified readme format

Browse files

Files changed (1) hide show

README.md +12 -10

README.md CHANGED Viewed

@@ -7,9 +7,9 @@ language:
   - it
   - ja
   - nl
-  - pl
   - pt
   - zh
 ---
 # Model Card for `passage-ranker.pistachio`
@@ -22,16 +22,16 @@ Model name: `passage-ranker.pistachio`
 The model was trained and tested in the following languages:
-- Chinese (simplified)
-- Dutch
 - English
 - French
 - German
 - Italian
 - Japanese
-- Polish
 - Portuguese
-- Spanish
 Besides the aforementioned languages, basic support can be expected for additional 93 languages that were used during the pretraining of the base model (see
 [list of languages](https://github.com/google-research/bert/blob/master/multilingual.md#list-of-languages)).
@@ -43,7 +43,7 @@ Besides the aforementioned languages, basic support can be expected for addition
 | English Relevance (NDCG@10) | 0.474 |
 | Polish Relevance (NDCG@10)  | 0.380 |
-Note that the relevance score is computed as an average over 14 retrieval datasets (see
 [details below](#evaluation-metrics)).
 ## Inference Times
@@ -131,7 +131,7 @@ the [PIRBenchmark](https://github.com/sdadas/pirb) with BM25 as the first stage
 | arguana-pl    |   0.285 |
 | dbpedia-pl    |   0.283 |
 | fiqa-pl       |   0.223 |
-| hotpoqa-pl    |   0.603 |
 | msmarco-pl    |   0.259 |
 | nfcorpus-pl   |   0.293 |
 | nq-pl         |   0.355 |
@@ -142,12 +142,14 @@ the [PIRBenchmark](https://github.com/sdadas/pirb) with BM25 as the first stage
 #### Other languages
-We evaluated the model on the datasets of the [MIRACL benchmark](https://github.com/project-miracl/miracl) to test its multilingual capacities. Note that not all training languages are part of the benchmark, so we only report the metrics for the existing languages.
 | Language              | NDCG@10 |
 |:----------------------|--------:|
-| Chinese (simplified)  |   0.454 |
 | French                |   0.439 |
 | German                |   0.418 |
 | Japanese              |   0.517 |
-| Spanish               |   0.487 |

   - it
   - ja
   - nl
   - pt
   - zh
+  - pl
 ---
 # Model Card for `passage-ranker.pistachio`
 The model was trained and tested in the following languages:
 - English
 - French
 - German
+- Spanish
 - Italian
+- Dutch
 - Japanese
 - Portuguese
+- Chinese (simplified)
+- Polish
 Besides the aforementioned languages, basic support can be expected for additional 93 languages that were used during the pretraining of the base model (see
 [list of languages](https://github.com/google-research/bert/blob/master/multilingual.md#list-of-languages)).
 | English Relevance (NDCG@10) | 0.474 |
 | Polish Relevance (NDCG@10)  | 0.380 |
+Note that the relevance score is computed as an average over several retrieval datasets (see
 [details below](#evaluation-metrics)).
 ## Inference Times
 | arguana-pl    |   0.285 |
 | dbpedia-pl    |   0.283 |
 | fiqa-pl       |   0.223 |
+| hotpotqa-pl   |   0.603 |
 | msmarco-pl    |   0.259 |
 | nfcorpus-pl   |   0.293 |
 | nq-pl         |   0.355 |
 #### Other languages
+We evaluated the model on the datasets of the [MIRACL benchmark](https://github.com/project-miracl/miracl) to test its
+multilingual capacities. Note that not all training languages are part of the benchmark, so we only report the metrics
+for the existing languages.
 | Language              | NDCG@10 |
 |:----------------------|--------:|
 | French                |   0.439 |
 | German                |   0.418 |
+| Spanish               |   0.487 |
 | Japanese              |   0.517 |
+| Chinese (simplified)  |   0.454 |