lewtun HF staff commited on
Commit
0ae8963
1 Parent(s): dacf355

Add evaluation results on the default config of xsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config of the [xsum](https://huggingface.co/datasets/xsum) dataset by

@sysresearch101

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-xsum-d7ddcd7b-12845710).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=xsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=xsum).

Files changed (1) hide show
  1. README.md +34 -1
README.md CHANGED
@@ -14,7 +14,7 @@ model-index:
14
  type: summarization
15
  name: Summarization
16
  dataset:
17
- name: xsum & cnn_dailymail
18
  type: xsum & cnn_dailymail
19
  config: 3.0.0
20
  split: train
@@ -43,6 +43,39 @@ model-index:
43
  type: gen_len
44
  value: <TODO>
45
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
  ---
47
 
48
  # T5-large Summarization Model Trained on the combined XSUM-CNN Daily Mail Dataset
 
14
  type: summarization
15
  name: Summarization
16
  dataset:
17
+ name: xsum & cnn_dailymail
18
  type: xsum & cnn_dailymail
19
  config: 3.0.0
20
  split: train
 
43
  type: gen_len
44
  value: <TODO>
45
  verified: true
46
+ - task:
47
+ type: summarization
48
+ name: Summarization
49
+ dataset:
50
+ name: xsum
51
+ type: xsum
52
+ config: default
53
+ split: test
54
+ metrics:
55
+ - name: ROUGE-1
56
+ type: rouge
57
+ value: 36.7656
58
+ verified: true
59
+ - name: ROUGE-2
60
+ type: rouge
61
+ value: 14.6898
62
+ verified: true
63
+ - name: ROUGE-L
64
+ type: rouge
65
+ value: 30.0646
66
+ verified: true
67
+ - name: ROUGE-LSUM
68
+ type: rouge
69
+ value: 30.0563
70
+ verified: true
71
+ - name: loss
72
+ type: loss
73
+ value: 1.6373405456542969
74
+ verified: true
75
+ - name: gen_len
76
+ type: gen_len
77
+ value: 18.6054
78
+ verified: true
79
  ---
80
 
81
  # T5-large Summarization Model Trained on the combined XSUM-CNN Daily Mail Dataset