Commit
•
22df070
1
Parent(s):
cdce6f8
Add evaluation results on the adversarialQA config and validation split of adversarial_qa
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the adversarialQA config and validation split of the [adversarial_qa](https://huggingface.co/datasets/adversarial_qa) dataset by
@awacke1
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-adversarial_qa-adversarialQA-e7f19d-38131101061).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=adversarial_qa).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=adversarial_qa).
README.md
CHANGED
@@ -5,7 +5,31 @@ datasets:
|
|
5 |
- squad
|
6 |
model-index:
|
7 |
- name: test-bert-finetuned-squad
|
8 |
-
results:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
---
|
10 |
|
11 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
5 |
- squad
|
6 |
model-index:
|
7 |
- name: test-bert-finetuned-squad
|
8 |
+
results:
|
9 |
+
- task:
|
10 |
+
type: question-answering
|
11 |
+
name: Question Answering
|
12 |
+
dataset:
|
13 |
+
name: adversarial_qa
|
14 |
+
type: adversarial_qa
|
15 |
+
config: adversarialQA
|
16 |
+
split: validation
|
17 |
+
metrics:
|
18 |
+
- type: f1
|
19 |
+
value: 31.2065
|
20 |
+
name: F1
|
21 |
+
verified: true
|
22 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTEwYjQ0ZWFhODdhMWE2ZjJiYTQ5ZWQyMTQyNjhjMGMyODc3YjYyZGRjMTQyZmM3OGQ0MWE4N2QyMDVlMDFhYSIsInZlcnNpb24iOjF9.gcGsKYrAp6Yiy9Khk7VvSqbWAag_Eoy2jevbUaWmdbOIxXipz-C8rMWDiJSByDfX97mIbbyOpmK-A47hJRxlBg
|
23 |
+
- type: exact_match
|
24 |
+
value: 19.9333
|
25 |
+
name: Exact Match
|
26 |
+
verified: true
|
27 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNzI4NzVlYjlhZWUwZDU2YmRmZDZiNjc4ODBiNjg3ZTBmNjZkYTg5NmRkYTdkZDM5NTE2NzEyMDE2ZmRiMTdhMiIsInZlcnNpb24iOjF9.RACgKZB3L0IqRt_UzEQt9r4TozyHCVTsG8MepFp98fvxf-wMrCouesH1TFkS84nnOo7MtHjUwfsfK8zkff_LDg
|
28 |
+
- type: loss
|
29 |
+
value: 3.904836654663086
|
30 |
+
name: loss
|
31 |
+
verified: true
|
32 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMTY1MmNmNTA5NTdkNjczYzRlMzRiMGIxNTNmYjBkZmRiOWI1NjNjMTg5YWU2MGVlODg2ODZkNTVlNTBiNzcxZSIsInZlcnNpb24iOjF9.pbRkqUex2OBeXGBWDKkz-cb5TsPXXnEPgsrhoT7-913AI_5AvZnCROTTcTy6tkQr98ab-md73mBCN7AHT0ntAg
|
33 |
---
|
34 |
|
35 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|