lewtun HF staff commited on
Commit
92bc33b
1 Parent(s): 34bc863

Add evaluation results on the plain_text config of anli

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config of the [anli](https://huggingface.co/datasets/anli) dataset by

@MoritzLaurer

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-anli-plain_text-1f482c-14395973).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=anli).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=anli).

Files changed (1) hide show
  1. README.md +53 -0
README.md CHANGED
@@ -68,6 +68,59 @@ model-index:
68
  type: loss
69
  value: 1.8788293600082397
70
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
71
  ---
72
  # DeBERTa-v3-base-mnli-fever-anli
73
  ## Model description
68
  type: loss
69
  value: 1.8788293600082397
70
  verified: true
71
+ - task:
72
+ type: natural-language-inference
73
+ name: Natural Language Inference
74
+ dataset:
75
+ name: anli
76
+ type: anli
77
+ config: plain_text
78
+ split: test_r2
79
+ metrics:
80
+ - name: Accuracy
81
+ type: accuracy
82
+ value: 0.547
83
+ verified: true
84
+ - name: Precision Macro
85
+ type: precision
86
+ value: 0.5472132584534576
87
+ verified: true
88
+ - name: Precision Micro
89
+ type: precision
90
+ value: 0.547
91
+ verified: true
92
+ - name: Precision Weighted
93
+ type: precision
94
+ value: 0.5472045067334657
95
+ verified: true
96
+ - name: Recall Macro
97
+ type: recall
98
+ value: 0.5469811128493762
99
+ verified: true
100
+ - name: Recall Micro
101
+ type: recall
102
+ value: 0.547
103
+ verified: true
104
+ - name: Recall Weighted
105
+ type: recall
106
+ value: 0.547
107
+ verified: true
108
+ - name: F1 Macro
109
+ type: f1
110
+ value: 0.5465246991169268
111
+ verified: true
112
+ - name: F1 Micro
113
+ type: f1
114
+ value: 0.547
115
+ verified: true
116
+ - name: F1 Weighted
117
+ type: f1
118
+ value: 0.5465299992353281
119
+ verified: true
120
+ - name: loss
121
+ type: loss
122
+ value: 1.6385536193847656
123
+ verified: true
124
  ---
125
  # DeBERTa-v3-base-mnli-fever-anli
126
  ## Model description