autoevaluator HF staff commited on
Commit
fd277e2
1 Parent(s): 168ef09

Add evaluation results on the default config and train split of boolq

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and train split of the [boolq](https://huggingface.co/datasets/boolq) dataset by @mabuyun, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-boolq-default-cb11e4-46279145185).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=boolq).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=boolq).

Files changed (1) hide show
  1. README.md +42 -0
README.md CHANGED
@@ -22,6 +22,48 @@ model_index:
22
  name: Accuracy
23
  type: accuracy
24
  value: 0.7314984709480122
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
22
  name: Accuracy
23
  type: accuracy
24
  value: 0.7314984709480122
25
+ model-index:
26
+ - name: andi611/distilbert-base-uncased-qa-boolq
27
+ results:
28
+ - task:
29
+ type: natural-language-inference
30
+ name: Natural Language Inference
31
+ dataset:
32
+ name: boolq
33
+ type: boolq
34
+ config: default
35
+ split: train
36
+ metrics:
37
+ - type: accuracy
38
+ value: 0.875676249071815
39
+ name: Accuracy
40
+ verified: true
41
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNDdjNjIwZDRlZDkzZDZmM2JmYzA0ZjIwMjBlZTI3OWQ5ZWNiNWU0OWI2ZWZmMGI2OGZmMDVhYzhjOTE1M2UzNSIsInZlcnNpb24iOjF9.A4-llThkLZ5SdVf6KTc7kWnJlpPna5b7hhzR7DdbFozIvqlFSeXqUhYf9lxn2svdvfiCJSsP3kHzcn46lYybAg
42
+ - type: precision
43
+ value: 0.8591506263366941
44
+ name: Precision
45
+ verified: true
46
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZmRhNDNhYjE3YTY4Mjk2ZThlMzQ5MGZiNGIxNmM4NDBlNzdlODkxYjRmNWM4YzAwZTlkOTFhZmJkMTQzZTYyZiIsInZlcnNpb24iOjF9.wl_bDHN2z0BXD5_IlLY8eQHFeCRkUGSj3NMOchIcbphiqoVoC_eWZNQqpZhM0XgCdoQrRKw4MNjCiwDq3euYCQ
47
+ - type: recall
48
+ value: 0.9574395641811372
49
+ name: Recall
50
+ verified: true
51
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZGM0ZWJhYjI4YWIwNGQxZGQ3MTA4M2JiOTE5ZDc1ZDk5YjI5N2VjYjQzMTM3ZjM4YjVlNjNhNmU0MTVjZGJkNCIsInZlcnNpb24iOjF9.oC3_3F4164-tAIb0huR5xdzzRLpbxyJ52waXaWjbES8h0YRCrIjzmzgbhx4PPulxm8J59X1RF1wFsVXFFco3Bg
52
+ - type: auc
53
+ value: 0.9423158636459945
54
+ name: AUC
55
+ verified: true
56
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYmE5YjAxYjVmMjQwMzM0ZjBmNmM2NjFjZTcxMzIzNTk3NTdlNzVlOTM3YTMxMTdlNWMzNmE3YTk5MDQ1Y2VhYSIsInZlcnNpb24iOjF9.96hf0lrJ59bzlDm8lX9fv4WqNTP0mFVtpILWz-L3yBZyb4TIIKUh-JgDRwlLPu-JZlZS-gJSeAxPobrhJY0iCg
57
+ - type: f1
58
+ value: 0.9056360708534621
59
+ name: F1
60
+ verified: true
61
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMzk0N2EzMWUyZGI4NmE3NjlkZDI3ZjkwMDIwZTdhNzAwZjBjNmYxYjYzYjJkMjFlOWRiNWUxMTFiZmM5ZmJhNyIsInZlcnNpb24iOjF9.W5wBUPEtxI2Movs6_UKrxA5sNNgV7m619TLWfwG5uSA0bgcE9xmH9EnNljsbSnFn2ObxTmrUK-W0OZ3SzL9hCg
62
+ - type: loss
63
+ value: 0.45028823614120483
64
+ name: loss
65
+ verified: true
66
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZDdkOGYyMTJlYmRlYTRmNGI3YzkyOTRhOGMwZGY2N2MzYjVhZjgwN2U2YjdjMGQzMmYyZjFkMTFlM2Y0NmQ0ZCIsInZlcnNpb24iOjF9.PJWhSy48ZNYnp76dTuvhuvj-EFFWd8hzN5He1nIlHOqiPHglCtnSon161R7Ar4ILWy4LyPM8ByRslhzJfj-WDw
67
  ---
68
 
69
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You