autoevaluator HF staff commited on
Commit
4575bec
1 Parent(s): c5121e4

Add evaluation results on the 3.0.0 config and test split of cnn_dailymail

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the 3.0.0 config and test split of the [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) dataset by @samuelallen123, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-cnn_dailymail-3.0.0-6f9c29-1531855204).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=cnn_dailymail).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=cnn_dailymail).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -41,6 +41,39 @@ model-index:
41
  type: gen_len
42
  value: 78.5866
43
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
44
  ---
45
  # BART (large-sized model), fine-tuned on CNN Daily Mail
46
 
 
41
  type: gen_len
42
  value: 78.5866
43
  verified: true
44
+ - task:
45
+ type: summarization
46
+ name: Summarization
47
+ dataset:
48
+ name: cnn_dailymail
49
+ type: cnn_dailymail
50
+ config: 3.0.0
51
+ split: test
52
+ metrics:
53
+ - name: ROUGE-1
54
+ type: rouge
55
+ value: 44.0558
56
+ verified: true
57
+ - name: ROUGE-2
58
+ type: rouge
59
+ value: 21.068
60
+ verified: true
61
+ - name: ROUGE-L
62
+ type: rouge
63
+ value: 30.6209
64
+ verified: true
65
+ - name: ROUGE-LSUM
66
+ type: rouge
67
+ value: 41.0126
68
+ verified: true
69
+ - name: loss
70
+ type: loss
71
+ value: 2.7001595497131348
72
+ verified: true
73
+ - name: gen_len
74
+ type: gen_len
75
+ value: 84.6869
76
+ verified: true
77
  ---
78
  # BART (large-sized model), fine-tuned on CNN Daily Mail
79