autoevaluator HF staff commited on
Commit
1e52cc5
1 Parent(s): d7a80b3

Add evaluation results on the default config and test split of multi_news

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [multi_news](https://huggingface.co/datasets/multi_news) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-multi_news-default-e22c67-2252871793).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=multi_news).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=multi_news).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -342,6 +342,39 @@ model-index:
342
  type: gen_len
343
  value: 222.6626
344
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
345
  ---
346
 
347
  # Longformer Encoder-Decoder (LED) for Narrative-Esque Long Text Summarization
 
342
  type: gen_len
343
  value: 222.6626
344
  verified: true
345
+ - task:
346
+ type: summarization
347
+ name: Summarization
348
+ dataset:
349
+ name: multi_news
350
+ type: multi_news
351
+ config: default
352
+ split: test
353
+ metrics:
354
+ - name: ROUGE-1
355
+ type: rouge
356
+ value: 38.7332
357
+ verified: true
358
+ - name: ROUGE-2
359
+ type: rouge
360
+ value: 11.0072
361
+ verified: true
362
+ - name: ROUGE-L
363
+ type: rouge
364
+ value: 18.6018
365
+ verified: true
366
+ - name: ROUGE-LSUM
367
+ type: rouge
368
+ value: 34.5911
369
+ verified: true
370
+ - name: loss
371
+ type: loss
372
+ value: 3.5744354724884033
373
+ verified: true
374
+ - name: gen_len
375
+ type: gen_len
376
+ value: 192.0014
377
+ verified: true
378
  ---
379
 
380
  # Longformer Encoder-Decoder (LED) for Narrative-Esque Long Text Summarization