Add evaluation results on the adversarialQA config of adversarial_qa
#2
by
autoevaluator
HF staff
- opened
README.md
CHANGED
@@ -11,6 +11,26 @@ datasets:
|
|
11 |
metrics:
|
12 |
- exact_match
|
13 |
- f1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
---
|
15 |
|
16 |
# Model Overview
|
@@ -24,4 +44,4 @@ Evaluation data: SQuAD + AdversarialQA
|
|
24 |
Approx. 1 training epoch on the synthetic data and 2 training epochs on the manually-curated data.
|
25 |
|
26 |
# Additional Information
|
27 |
-
Please refer to https://arxiv.org/abs/2104.08678 for full details. You can interact with the model on Dynabench here: https://dynabench.org/models/109
|
|
|
11 |
metrics:
|
12 |
- exact_match
|
13 |
- f1
|
14 |
+
model-index:
|
15 |
+
- name: mbartolo/electra-large-synqa
|
16 |
+
results:
|
17 |
+
- task:
|
18 |
+
type: question-answering
|
19 |
+
name: Question Answering
|
20 |
+
dataset:
|
21 |
+
name: adversarial_qa
|
22 |
+
type: adversarial_qa
|
23 |
+
config: adversarialQA
|
24 |
+
split: validation
|
25 |
+
metrics:
|
26 |
+
- name: Exact Match
|
27 |
+
type: exact_match
|
28 |
+
value: 54.3667
|
29 |
+
verified: true
|
30 |
+
- name: F1
|
31 |
+
type: f1
|
32 |
+
value: 67.3589
|
33 |
+
verified: true
|
34 |
---
|
35 |
|
36 |
# Model Overview
|
|
|
44 |
Approx. 1 training epoch on the synthetic data and 2 training epochs on the manually-curated data.
|
45 |
|
46 |
# Additional Information
|
47 |
+
Please refer to https://arxiv.org/abs/2104.08678 for full details. You can interact with the model on Dynabench here: https://dynabench.org/models/109
|