autoevaluator HF staff commited on
Commit
22df070
1 Parent(s): cdce6f8

Add evaluation results on the adversarialQA config and validation split of adversarial_qa

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the adversarialQA config and validation split of the [adversarial_qa](https://huggingface.co/datasets/adversarial_qa) dataset by

@awacke1

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-adversarial_qa-adversarialQA-e7f19d-38131101061).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=adversarial_qa).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=adversarial_qa).

Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -5,7 +5,31 @@ datasets:
5
  - squad
6
  model-index:
7
  - name: test-bert-finetuned-squad
8
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
5
  - squad
6
  model-index:
7
  - name: test-bert-finetuned-squad
8
+ results:
9
+ - task:
10
+ type: question-answering
11
+ name: Question Answering
12
+ dataset:
13
+ name: adversarial_qa
14
+ type: adversarial_qa
15
+ config: adversarialQA
16
+ split: validation
17
+ metrics:
18
+ - type: f1
19
+ value: 31.2065
20
+ name: F1
21
+ verified: true
22
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTEwYjQ0ZWFhODdhMWE2ZjJiYTQ5ZWQyMTQyNjhjMGMyODc3YjYyZGRjMTQyZmM3OGQ0MWE4N2QyMDVlMDFhYSIsInZlcnNpb24iOjF9.gcGsKYrAp6Yiy9Khk7VvSqbWAag_Eoy2jevbUaWmdbOIxXipz-C8rMWDiJSByDfX97mIbbyOpmK-A47hJRxlBg
23
+ - type: exact_match
24
+ value: 19.9333
25
+ name: Exact Match
26
+ verified: true
27
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNzI4NzVlYjlhZWUwZDU2YmRmZDZiNjc4ODBiNjg3ZTBmNjZkYTg5NmRkYTdkZDM5NTE2NzEyMDE2ZmRiMTdhMiIsInZlcnNpb24iOjF9.RACgKZB3L0IqRt_UzEQt9r4TozyHCVTsG8MepFp98fvxf-wMrCouesH1TFkS84nnOo7MtHjUwfsfK8zkff_LDg
28
+ - type: loss
29
+ value: 3.904836654663086
30
+ name: loss
31
+ verified: true
32
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMTY1MmNmNTA5NTdkNjczYzRlMzRiMGIxNTNmYjBkZmRiOWI1NjNjMTg5YWU2MGVlODg2ODZkNTVlNTBiNzcxZSIsInZlcnNpb24iOjF9.pbRkqUex2OBeXGBWDKkz-cb5TsPXXnEPgsrhoT7-913AI_5AvZnCROTTcTy6tkQr98ab-md73mBCN7AHT0ntAg
33
  ---
34
 
35
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You