sinequa
/

answer-finder.yuzu

@@ -1,30 +1,63 @@
 ---
 language:
 - ja
 ---
-# Model Card for answer-finder-v1-jp
-This model is a Japanese Answer-Finder. It retrieves answers from questions and passages for the Sinequa Search Plateform.
-# Supported Languages
 - Japanese
-# Model score
-|| Relevance Score |
-|:-----------------|:---:|
-|__Japanese__      | <span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:black;">&starf;</span> |
-||
-|Speed Score|
-|---|
-|<span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:#ffd21e;">&starf;</span><span style="font-size:200%;color:black;">&starf;</span><span style="font-size:200%;color:black;">&starf;</span><span style="font-size:200%;color:black;">&starf;</span><span style="font-size:200%;color:black;">&starf;</span><span style="font-size:200%;color:black;">&starf;</span>|
----
-# Training data
-| Dataset                                                | Paper                                    |
-|--------------------------------------------------------|:----------------------------------------:|
-| [JSQuAD](https://github.com/yahoojapan/JGLUE)          | [paper](https://aclanthology.org/2022.lrec-1.317.pdf) |

 ---
 language:
 - ja
 ---
+# Model Card for answer-finder-v1-L-ja
+This model is a question answering model developed by Sinequa. It produces two lists of logit scores corresponding to
+the start token and end token of an answer.
+Model name: `answer-finder-v1-L-ja`
+## Supported Languages
+The model was trained and tested in the following languages:
 - Japanese
+Besides the aforementioned languages, basic support can be expected for the 104 languages that were used during the
+pretraining of the base model (See [original repository](https://github.com/google-research/bert)).
+## Scores
+| Metric                                                        |  Value |
+|:--------------------------------------------------------------|-------:|
+| F1 Score on JSQuAD with Hugging Face evaluation pipeline      |   92.1 |
+| F1 Score on JSQuAD with Haystack evaluation pipeline          |   91.5 |
+## Inference Time
+| GPU                                                           |  Batch size 1  |  Batch size 32 |
+|:--------------------------------------------------------------|---------------:|---------------:|
+| NVIDIA A10                                                    |           4 ms |          84 ms |
+| NVIDIA T4                                                     |          15 ms |         361 ms |
+The inference times only measure the time the model takes to process a single batch, it does not include pre- or
+post-processing steps like the tokenization.
+**Note that the Answer Finder models are only used at query time.**
+## Requirements
+- Minimal Sinequa version: 11.10.0
+- GPU memory usage: TODO
+Note that GPU memory usage only includes how much GPU memory the actual model consumes on an NVIDIA T4 GPU with a batch
+size of 32. It does not include the fix amount of memory that is consumed by the ONNX Runtime upon initialization which
+can be around 0.5 to 1 GiB depending on the used GPU.
+## Model Details
+### Overview
+- Number of parameters: 110 million
+- Base language model: [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased)
+- Sensitive to casing and accents
+### Training Data
+- [JSQuAD](https://github.com/yahoojapan/JGLUE) see [Paper](https://aclanthology.org/2022.lrec-1.317.pdf)
+- Japanese translation of SQuAD v2 "impossible" query-passage pairs