Add evaluation results on the samsum config of samsum
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the samsum config of the [samsum](https://huggingface.co/datasets/samsum) dataset by
@lewtun
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-samsum-ede55545-13415852).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=samsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=samsum).
README.md
CHANGED
@@ -144,6 +144,39 @@ model-index:
|
|
144 |
type: gen_len
|
145 |
value: 183.3702
|
146 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
147 |
---
|
148 |
|
149 |
# BigBirdPegasus model (large)
|
|
|
144 |
type: gen_len
|
145 |
value: 183.3702
|
146 |
verified: true
|
147 |
+
- task:
|
148 |
+
type: summarization
|
149 |
+
name: Summarization
|
150 |
+
dataset:
|
151 |
+
name: samsum
|
152 |
+
type: samsum
|
153 |
+
config: samsum
|
154 |
+
split: test
|
155 |
+
metrics:
|
156 |
+
- name: ROUGE-1
|
157 |
+
type: rouge
|
158 |
+
value: 3.621
|
159 |
+
verified: true
|
160 |
+
- name: ROUGE-2
|
161 |
+
type: rouge
|
162 |
+
value: 0.1699
|
163 |
+
verified: true
|
164 |
+
- name: ROUGE-L
|
165 |
+
type: rouge
|
166 |
+
value: 3.2016
|
167 |
+
verified: true
|
168 |
+
- name: ROUGE-LSUM
|
169 |
+
type: rouge
|
170 |
+
value: 3.3269
|
171 |
+
verified: true
|
172 |
+
- name: loss
|
173 |
+
type: loss
|
174 |
+
value: 7.664482116699219
|
175 |
+
verified: true
|
176 |
+
- name: gen_len
|
177 |
+
type: gen_len
|
178 |
+
value: 233.8107
|
179 |
+
verified: true
|
180 |
---
|
181 |
|
182 |
# BigBirdPegasus model (large)
|