Add evaluation results on the 3.0.0 config of cnn_dailymail
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the 3.0.0 config of the [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) dataset by
@pszemraj
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-cnn_dailymail-b5ccd808-10945470).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=cnn_dailymail).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=cnn_dailymail).
README.md
CHANGED
@@ -275,6 +275,39 @@ model-index:
|
|
275 |
type: gen_len
|
276 |
value: 248.7994
|
277 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
278 |
---
|
279 |
|
280 |
# long-t5-tglobal-base-16384 + BookSum
|
|
|
275 |
type: gen_len
|
276 |
value: 248.7994
|
277 |
verified: true
|
278 |
+
- task:
|
279 |
+
type: summarization
|
280 |
+
name: Summarization
|
281 |
+
dataset:
|
282 |
+
name: cnn_dailymail
|
283 |
+
type: cnn_dailymail
|
284 |
+
config: 3.0.0
|
285 |
+
split: test
|
286 |
+
metrics:
|
287 |
+
- name: ROUGE-1
|
288 |
+
type: rouge
|
289 |
+
value: 30.7382
|
290 |
+
verified: true
|
291 |
+
- name: ROUGE-2
|
292 |
+
type: rouge
|
293 |
+
value: 7.3107
|
294 |
+
verified: true
|
295 |
+
- name: ROUGE-L
|
296 |
+
type: rouge
|
297 |
+
value: 17.7773
|
298 |
+
verified: true
|
299 |
+
- name: ROUGE-LSUM
|
300 |
+
type: rouge
|
301 |
+
value: 27.4241
|
302 |
+
verified: true
|
303 |
+
- name: loss
|
304 |
+
type: loss
|
305 |
+
value: .nan
|
306 |
+
verified: true
|
307 |
+
- name: gen_len
|
308 |
+
type: gen_len
|
309 |
+
value: 125.228
|
310 |
+
verified: true
|
311 |
---
|
312 |
|
313 |
# long-t5-tglobal-base-16384 + BookSum
|