autoevaluator HF staff commited on
Commit
f0a613f
1 Parent(s): 6401238

Add evaluation results on the default config and test split of multi_news

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [multi_news](https://huggingface.co/datasets/multi_news) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-multi_news-default-e22c67-2252871792).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=multi_news).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=multi_news).

Files changed (1) hide show
  1. README.md +36 -1
README.md CHANGED
@@ -15,7 +15,42 @@ datasets:
15
  metrics:
16
  - rouge
17
  inference: false
18
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  ---
20
 
21
  # long-t5-tglobal-xl + BookSum
 
15
  metrics:
16
  - rouge
17
  inference: false
18
+ model-index:
19
+ - name: pszemraj/long-t5-tglobal-xl-16384-book-summary
20
+ results:
21
+ - task:
22
+ type: summarization
23
+ name: Summarization
24
+ dataset:
25
+ name: multi_news
26
+ type: multi_news
27
+ config: default
28
+ split: test
29
+ metrics:
30
+ - name: ROUGE-1
31
+ type: rouge
32
+ value: 36.2043
33
+ verified: true
34
+ - name: ROUGE-2
35
+ type: rouge
36
+ value: 8.424
37
+ verified: true
38
+ - name: ROUGE-L
39
+ type: rouge
40
+ value: 17.3721
41
+ verified: true
42
+ - name: ROUGE-LSUM
43
+ type: rouge
44
+ value: 32.3994
45
+ verified: true
46
+ - name: loss
47
+ type: loss
48
+ value: 2.0843334197998047
49
+ verified: true
50
+ - name: gen_len
51
+ type: gen_len
52
+ value: 248.3572
53
+ verified: true
54
  ---
55
 
56
  # long-t5-tglobal-xl + BookSum