deepset
/

xlm-roberta-base-squad2-distilled

Question Answering

Inference Endpoints

Model card Files Files and versions Community

MichelBartelsDeepset commited on Jan 6, 2022

Commit

9fdd591

•

1 Parent(s): 26cd7db

Update README.md

Files changed (1) hide show

README.md +6 -16

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 language: Multilingual
 datasets:
-- deepset/germanquad
 license: mit
 thumbnail: https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg
 tags:
@@ -11,20 +11,14 @@ tags:
 ![bert_image](https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg)
 ## Overview
-**Language model:** deepset/xlm-roberta-base-squad2-distilled
-**Language:** German
-**Training data:** GermanQuAD train set (~ 12MB)
-**Eval data:** GermanQuAD test set (~ 5MB)
 **Infrastructure**: 1x V100 GPU
 **Published**: Apr 21st, 2021
 ## Details
-- We trained a German question answering model with a gelectra-base model as its basis.
-- The dataset is GermanQuAD, a new, German language dataset, which we hand-annotated and published [online](https://deepset.ai/germanquad).
-- The training dataset is one-way annotated and contains 11518 questions and 11518 answers, while the test dataset is three-way annotated so that there are 2204 questions and with 2204·3−76 = 6536answers, because we removed 76 wrong answers.
-- In addition to the annotations in GermanQuAD, haystack's distillation feature was used for training. deepset/xlm-roberta-large-squad2 was used as the teacher model.
-See https://deepset.ai/germanquad for more details and dataset download in SQuAD format.
 ## Hyperparameters
 ```
@@ -38,11 +32,7 @@ temperature = 3
 distillation_loss_weight = 0.75
 ```
 ## Performance
-We evaluated the extractive question answering performance on the SQuAD v2 dev set.
-Model types and training data are included in the model name.
-For finetuning XLM-Roberta, we use the English SQuAD v2.0 dataset.
-The GELECTRA models are warm started on the German translation of SQuAD v1.1 and finetuned on \\\\germanquad.
-The human baseline was computed for the 3-way test set by taking one answer as prediction and the other two as ground truth.
 ```
 "exact": 79.8366040596311%
 "f1": 83.916407079888%

 ---
 language: Multilingual
 datasets:
+- squad_v2
 license: mit
 thumbnail: https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg
 tags:
 ![bert_image](https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg)
 ## Overview
+**Language model:** deepset/roberta-base-squad2-distilled
+**Language:** Multilingual
+**Training data:** SQuAD 2.0 training set
 **Infrastructure**: 1x V100 GPU
 **Published**: Apr 21st, 2021
 ## Details
+- haystack's distillation feature was used for training. deepset/xlm-roberta-large-squad2 was used as the teacher model.
 ## Hyperparameters
 ```
 distillation_loss_weight = 0.75
 ```
 ## Performance
+SQuAD v2 dev set:
 ```
 "exact": 79.8366040596311%
 "f1": 83.916407079888%