lewtun HF staff commited on
Commit
3b19808
1 Parent(s): 9ff7843

Add evaluation results on the qqp config of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the qqp config of the [glue](https://huggingface.co/datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-glue-qqp-c973af-14676011).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -27,6 +27,39 @@ model-index:
27
  - name: F1
28
  type: f1
29
  value: 0.8767633750332712
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30
  ---
31
 
32
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
27
  - name: F1
28
  type: f1
29
  value: 0.8767633750332712
30
+ - task:
31
+ type: natural-language-inference
32
+ name: Natural Language Inference
33
+ dataset:
34
+ name: glue
35
+ type: glue
36
+ config: qqp
37
+ split: validation
38
+ metrics:
39
+ - name: Accuracy
40
+ type: accuracy
41
+ value: 0.9083106604006925
42
+ verified: true
43
+ - name: Precision
44
+ type: precision
45
+ value: 0.8684245220830586
46
+ verified: true
47
+ - name: Recall
48
+ type: recall
49
+ value: 0.885052065838092
50
+ verified: true
51
+ - name: AUC
52
+ type: auc
53
+ value: 0.966124561811495
54
+ verified: true
55
+ - name: F1
56
+ type: f1
57
+ value: 0.876659457660955
58
+ verified: true
59
+ - name: loss
60
+ type: loss
61
+ value: 0.3751780092716217
62
+ verified: true
63
  ---
64
 
65
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You