autoevaluator HF staff commited on
Commit
b1a87d7
1 Parent(s): 0332187

Add evaluation results on the adversarialQA config and validation split of adversarial_qa

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the adversarialQA config and validation split of the [adversarial_qa](https://huggingface.co/datasets/adversarial_qa) dataset by @nomic-ai, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-adversarial_qa-adversarialQA-1b5bc0-46134145181).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=adversarial_qa).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=adversarial_qa).

Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -23,6 +23,33 @@ model_index:
23
  dataset:
24
  name: conll2003
25
  type: conll2003
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
23
  dataset:
24
  name: conll2003
25
  type: conll2003
26
+ model-index:
27
+ - name: andi611/bert-large-uncased-whole-word-masking-squad2-with-ner-conll2003-with-neg-with-repeat
28
+ results:
29
+ - task:
30
+ type: question-answering
31
+ name: Question Answering
32
+ dataset:
33
+ name: adversarial_qa
34
+ type: adversarial_qa
35
+ config: adversarialQA
36
+ split: validation
37
+ metrics:
38
+ - type: f1
39
+ value: 18.5493
40
+ name: F1
41
+ verified: true
42
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNDY3NjQ2ZDViNGU5OTA4YWRmNTA0NTNhOWJmNDQ3NzgwY2ZiYmYyYzAxYTE3YTgxNGZhMjBjY2YwODMwZGVhOSIsInZlcnNpb24iOjF9.mmxIAZ6p-fxKtLI49f9CvjB_LjNrgDJ-TU3esLiV5dv9y6HkRgZ5PAsF0VO6gbdfEJWxKWZFX0aBlNo9Vz_tBg
43
+ - type: exact_match
44
+ value: 13.3333
45
+ name: Exact Match
46
+ verified: true
47
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOWZlZWU3MDMzY2NkYTdkNTY2NTg5OTZiMzczNDVjYThjZmQ4NTlmZjQwM2VhMjVhOTZjZjliMTI4ZWFkMjA3ZSIsInZlcnNpb24iOjF9.HN7DglPZtzdAJ_vwyGleQLKKJJautl8b3jroS3FqUmz3dLQWNS9omjAFuu5i1G7pxom2DhKTXwZKFhIxwzReDA
48
+ - type: loss
49
+ value: 7.114065647125244
50
+ name: loss
51
+ verified: true
52
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWZhODY3NTdhNDFiNTYyZGJhZjc0OWJjZGNkNDlkNzc2MTAxMzI2Yzg0NjkxYTRmOTY5YWZhNTI0NzE4MzI2ZiIsInZlcnNpb24iOjF9.wzZOasFqp8mrzS26Ubz8CQq2fnpyXJF0V0c1I6gjaiWcMoMwePNSapXK41cD4tB5orRqKeQFodvp2hW7xjtSCQ
53
  ---
54
 
55
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You