lewtun HF staff commited on
Commit
22e1ae8
1 Parent(s): 730f7a5

Add evaluation results on the mnli config of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mnli config of the [glue](https://huggingface.co/datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-glue-mnli-026a6e-14686020).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +53 -0
README.md CHANGED
@@ -19,6 +19,59 @@ model-index:
19
  - name: Accuracy
20
  type: accuracy
21
  value: 0.5818644931227712
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
  ---
23
 
24
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
19
  - name: Accuracy
20
  type: accuracy
21
  value: 0.5818644931227712
22
+ - task:
23
+ type: natural-language-inference
24
+ name: Natural Language Inference
25
+ dataset:
26
+ name: glue
27
+ type: glue
28
+ config: mnli
29
+ split: validation_matched
30
+ metrics:
31
+ - name: Accuracy
32
+ type: accuracy
33
+ value: 0.42017320427916455
34
+ verified: true
35
+ - name: Precision Macro
36
+ type: precision
37
+ value: 0.4778318048060659
38
+ verified: true
39
+ - name: Precision Micro
40
+ type: precision
41
+ value: 0.42017320427916455
42
+ verified: true
43
+ - name: Precision Weighted
44
+ type: precision
45
+ value: 0.4806016887604142
46
+ verified: true
47
+ - name: Recall Macro
48
+ type: recall
49
+ value: 0.4256080108498115
50
+ verified: true
51
+ - name: Recall Micro
52
+ type: recall
53
+ value: 0.42017320427916455
54
+ verified: true
55
+ - name: Recall Weighted
56
+ type: recall
57
+ value: 0.42017320427916455
58
+ verified: true
59
+ - name: F1 Macro
60
+ type: f1
61
+ value: 0.4153902114594921
62
+ verified: true
63
+ - name: F1 Micro
64
+ type: f1
65
+ value: 0.42017320427916455
66
+ verified: true
67
+ - name: F1 Weighted
68
+ type: f1
69
+ value: 0.41424823889685775
70
+ verified: true
71
+ - name: loss
72
+ type: loss
73
+ value: 2.7295780181884766
74
+ verified: true
75
  ---
76
 
77
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You