autoevaluator HF staff commited on
Commit
2d5e01a
1 Parent(s): b7c31bc

Add evaluation results on the default config and test split of multi_news

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [multi_news](https://huggingface.co/datasets/multi_news) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-multi_news-default-e22c67-2252871794).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=multi_news).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=multi_news).

Files changed (1) hide show
  1. README.md +34 -1
README.md CHANGED
@@ -9,7 +9,7 @@ tags:
9
  - booksum
10
  - long-document
11
  - long-form
12
- license:
13
  - apache-2.0
14
  - bsd-3-clause
15
  datasets:
@@ -278,6 +278,39 @@ model-index:
278
  type: gen_len
279
  value: 163.9394
280
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
281
  ---
282
 
283
  # Longformer Encoder-Decoder (LED) for Narrative-Esque Long Text Summarization
 
9
  - booksum
10
  - long-document
11
  - long-form
12
+ license:
13
  - apache-2.0
14
  - bsd-3-clause
15
  datasets:
 
278
  type: gen_len
279
  value: 163.9394
280
  verified: true
281
+ - task:
282
+ type: summarization
283
+ name: Summarization
284
+ dataset:
285
+ name: multi_news
286
+ type: multi_news
287
+ config: default
288
+ split: test
289
+ metrics:
290
+ - name: ROUGE-1
291
+ type: rouge
292
+ value: 39.0834
293
+ verified: true
294
+ - name: ROUGE-2
295
+ type: rouge
296
+ value: 11.4043
297
+ verified: true
298
+ - name: ROUGE-L
299
+ type: rouge
300
+ value: 19.1813
301
+ verified: true
302
+ - name: ROUGE-LSUM
303
+ type: rouge
304
+ value: 35.1581
305
+ verified: true
306
+ - name: loss
307
+ type: loss
308
+ value: 4.654905319213867
309
+ verified: true
310
+ - name: gen_len
311
+ type: gen_len
312
+ value: 186.2494
313
+ verified: true
314
  ---
315
 
316
  # Longformer Encoder-Decoder (LED) for Narrative-Esque Long Text Summarization