autoevaluator HF staff commited on
Commit
926b247
1 Parent(s): bdda27d

Add evaluation results on the document config and test split of ccdv/arxiv-summarization

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the document config and test split of the [ccdv/arxiv-summarization](https://huggingface.co/datasets/ccdv/arxiv-summarization) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-ccdv__arxiv-summarization-document-dcd037-2375274516).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=ccdv/arxiv-summarization).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=ccdv/arxiv-summarization).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -90,6 +90,39 @@ model-index:
90
  type: gen_len
91
  value: 291.8158
92
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
93
  ---
94
 
95
  # long-t5-tglobal-xl + BookSum
 
90
  type: gen_len
91
  value: 291.8158
92
  verified: true
93
+ - task:
94
+ type: summarization
95
+ name: Summarization
96
+ dataset:
97
+ name: ccdv/arxiv-summarization
98
+ type: ccdv/arxiv-summarization
99
+ config: document
100
+ split: test
101
+ metrics:
102
+ - name: ROUGE-1
103
+ type: rouge
104
+ value: 36.3225
105
+ verified: true
106
+ - name: ROUGE-2
107
+ type: rouge
108
+ value: 9.3743
109
+ verified: true
110
+ - name: ROUGE-L
111
+ type: rouge
112
+ value: 19.8396
113
+ verified: true
114
+ - name: ROUGE-LSUM
115
+ type: rouge
116
+ value: 32.2532
117
+ verified: true
118
+ - name: loss
119
+ type: loss
120
+ value: 2.146871566772461
121
+ verified: true
122
+ - name: gen_len
123
+ type: gen_len
124
+ value: 186.2966
125
+ verified: true
126
  ---
127
 
128
  # long-t5-tglobal-xl + BookSum