CATIE-AQ
/

QAmembert

@@ -39,25 +39,51 @@ This represents a total of over **138 061 questions/answers pairs used to finet
 | [PIAFv1.2](https://www.data.gouv.fr/en/datasets/piaf-le-dataset-francophone-de-questions-reponses/)| SQuAD v1    | 9 225 Q & A  | X  | X  |
 | [FQuADv1.0](https://fquad.illuin.tech/)| SQuAD v1    | 20 731 Q & A | 3 188 Q & A  (not used in training because it serves as a test dataset) | 2 189 Q & A (not used in our work because not freely available)|
 | [lincoln/newsquadfr](https://huggingface.co/datasets/lincoln/newsquadfr) | SQuAD v1    | 1 650 Q & A  | 455 Q & A (not used in our work) | 415 Q & A (not used in our work) |
-| [pragnakalp/squad_v2_french_translated](https://huggingface.co/datasets/pragnakalp/squad_v2_french_translated)| SQuAD v2    | 79 069 Q & A  | X  | X  |
-| [Mfa]()♪   | SQuAD v2    | 27 386 Q & A  | X  | X  |
-♪ this fifth data set will be added soon.
 ## Evaluation results
-### FQuAD v1.0 Evaluation
-```shell
-{"f1": 80.75789384679857, "exact_match": 57.214554579673774}
-```
-### Benchmark
 | Model       | Exact_match | F1-score    |
 | ----------- | ----------- | ----------- |
-| [etalab-ia/camembert-base-squadFR-fquad-piaf](https://huggingface.co/etalab-ia/camembert-base-squadFR-fquad-piaf) | 55.14       | 79.81       |
-| QAmembert   | **57.21**       | **80.76**       |
 ## Usage
 ### Example with answer in the context

 | [PIAFv1.2](https://www.data.gouv.fr/en/datasets/piaf-le-dataset-francophone-de-questions-reponses/)| SQuAD v1    | 9 225 Q & A  | X  | X  |
 | [FQuADv1.0](https://fquad.illuin.tech/)| SQuAD v1    | 20 731 Q & A | 3 188 Q & A  (not used in training because it serves as a test dataset) | 2 189 Q & A (not used in our work because not freely available)|
 | [lincoln/newsquadfr](https://huggingface.co/datasets/lincoln/newsquadfr) | SQuAD v1    | 1 650 Q & A  | 455 Q & A (not used in our work) | 415 Q & A (not used in our work) |
+| [pragnakalp/squad_v2_french_translated](https://huggingface.co/datasets/pragnakalp/squad_v2_french_translated)| SQuAD v2    | 79 069 Q & A  | X  | X  |
 ## Evaluation results
+The evaluation was carried out using the [**evaluate**](https://pypi.org/project/evaluate/) python package.
+### FQuaD 1.0 (validation)
+The metric used is Squad v1.
+| Model       | Exact_match | F1-score    |
+| ----------- | ----------- | ----------- |
+| [etalab-ia/camembert-base-squadFR-fquad-piaf](https://huggingface.co/etalab-ia/camembert-base-squadFR-fquad-piaf) | 53.60       | 78.09       |
+| QAmembert (previous version)   | 54.26       | 77.87       |
+| QAmembert (this version)   | 53.98       | 78.00       |
+| QAmembert-large ♪  | **55.95**       | **81.05**       |
+| [fT0](https://huggingface.co/CATIE-AQ/frenchT0)  | 41.15       | 65.79       |
+♪ this model is available on demand only.
+### qwant/squad_fr (validation)
+The metric used is Squad v1.
 | Model       | Exact_match | F1-score    |
 | ----------- | ----------- | ----------- |
+| [etalab-ia/camembert-base-squadFR-fquad-piaf](https://huggingface.co/etalab-ia/camembert-base-squadFR-fquad-piaf) | 60.17       | 78.27       |
+| QAmembert (previous version)   | 60.40       | 77.27       |
+| QAmembert (this version)   |  60.95       | 77.30       |
+| QAmembert-large ♪  | **65.58**       | **81.74**       |
+♪ this model is available on demand only.
+### frenchQA
+This dataset includes question with no answers in the context. The metric used is Squad v2.
+| Model       | Exact_match | F1-score    | Answer_f1 | NoAnswer_f1 |
+| ----------- | ----------- | ----------- | ----------- | ----------- |
+| [etalab-ia/camembert-base-squadFR-fquad-piaf](https://huggingface.co/etalab-ia/camembert-base-squadFR-fquad-piaf) | n/a       | n/a       | n/a       | n/a       |
+| QAmembert (previous version)   | 60.28       | 71.29       | 75.92 | 66.65
+| QAmembert (this version)   |  **77.14**       | 86.88       | 75.66 | 98.11
+| QAmembert-large ♪  | **77.14**       | **88.74**       | **78.83** | **98.65**
+♪ this model is available on demand only.
 ## Usage
 ### Example with answer in the context

config.json CHANGED Viewed

@@ -21,7 +21,7 @@
   "pad_token_id": 1,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.24.0",
   "type_vocab_size": 1,
   "use_cache": true,
   "vocab_size": 32005

   "pad_token_id": 1,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.26.1",
   "type_vocab_size": 1,
   "use_cache": true,
   "vocab_size": 32005

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6204c15c7cef356c5a2b4b4c254c71adf9564022176bf383168968f0b09e8115
-size 440202673

 version https://git-lfs.github.com/spec/v1
+oid sha256:36796fd3145baf67e83b7878ce5793998e26115a4dac47d9a5a8fee831a214d7
+size 440204333

tokenizer.json CHANGED Viewed

@@ -1,21 +1,7 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 512,
-    "strategy": "OnlySecond",
-    "stride": 128
-  },
-  "padding": {
-    "strategy": {
-      "Fixed": 512
-    },
-    "direction": "Right",
-    "pad_to_multiple_of": null,
-    "pad_id": 1,
-    "pad_type_id": 0,
-    "pad_token": "<pad>"
-  },
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": null,
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,