Add evaluation results on the plain_text config of anli
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config of the [anli](https://huggingface.co/datasets/anli) dataset by
@MoritzLaurer
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-anli-plain_text-dfb10f-14405974).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=anli).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=anli).
README.md
CHANGED
@@ -68,6 +68,59 @@ model-index:
|
|
68 |
type: loss
|
69 |
value: 1.8788293600082397
|
70 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
71 |
---
|
72 |
# DeBERTa-v3-base-mnli-fever-anli
|
73 |
## Model description
|
|
|
68 |
type: loss
|
69 |
value: 1.8788293600082397
|
70 |
verified: true
|
71 |
+
- task:
|
72 |
+
type: natural-language-inference
|
73 |
+
name: Natural Language Inference
|
74 |
+
dataset:
|
75 |
+
name: anli
|
76 |
+
type: anli
|
77 |
+
config: plain_text
|
78 |
+
split: test_r1
|
79 |
+
metrics:
|
80 |
+
- name: Accuracy
|
81 |
+
type: accuracy
|
82 |
+
value: 0.712
|
83 |
+
verified: true
|
84 |
+
- name: Precision Macro
|
85 |
+
type: precision
|
86 |
+
value: 0.7134839439315348
|
87 |
+
verified: true
|
88 |
+
- name: Precision Micro
|
89 |
+
type: precision
|
90 |
+
value: 0.712
|
91 |
+
verified: true
|
92 |
+
- name: Precision Weighted
|
93 |
+
type: precision
|
94 |
+
value: 0.7134676028447461
|
95 |
+
verified: true
|
96 |
+
- name: Recall Macro
|
97 |
+
type: recall
|
98 |
+
value: 0.7119814425203647
|
99 |
+
verified: true
|
100 |
+
- name: Recall Micro
|
101 |
+
type: recall
|
102 |
+
value: 0.712
|
103 |
+
verified: true
|
104 |
+
- name: Recall Weighted
|
105 |
+
type: recall
|
106 |
+
value: 0.712
|
107 |
+
verified: true
|
108 |
+
- name: F1 Macro
|
109 |
+
type: f1
|
110 |
+
value: 0.7119226991285647
|
111 |
+
verified: true
|
112 |
+
- name: F1 Micro
|
113 |
+
type: f1
|
114 |
+
value: 0.712
|
115 |
+
verified: true
|
116 |
+
- name: F1 Weighted
|
117 |
+
type: f1
|
118 |
+
value: 0.7119242267218338
|
119 |
+
verified: true
|
120 |
+
- name: loss
|
121 |
+
type: loss
|
122 |
+
value: 1.0105403661727905
|
123 |
+
verified: true
|
124 |
---
|
125 |
# DeBERTa-v3-base-mnli-fever-anli
|
126 |
## Model description
|