lewtun HF staff commited on
Commit
9fc3e47
1 Parent(s): 2e4f613

Add evaluation results on the default config of xsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config of the [xsum](https://huggingface.co/datasets/xsum) dataset by

@sysresearch101

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-xsum-d7ddcd7b-12845708).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=xsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=xsum).

Files changed (1) hide show
  1. README.md +35 -2
README.md CHANGED
@@ -13,8 +13,8 @@ model-index:
13
  type: summarization
14
  name: Summarization
15
  dataset:
16
- name: xsum
17
- type: xsum
18
  config: 3.0.0
19
  split: train
20
  metrics:
@@ -42,6 +42,39 @@ model-index:
42
  type: gen_len
43
  value: <TODO>
44
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
45
  ---
46
 
47
  # T5-large Summarization Model Trained on the XSUM Dataset
 
13
  type: summarization
14
  name: Summarization
15
  dataset:
16
+ name: xsum
17
+ type: xsum
18
  config: 3.0.0
19
  split: train
20
  metrics:
 
42
  type: gen_len
43
  value: <TODO>
44
  verified: true
45
+ - task:
46
+ type: summarization
47
+ name: Summarization
48
+ dataset:
49
+ name: xsum
50
+ type: xsum
51
+ config: default
52
+ split: test
53
+ metrics:
54
+ - name: ROUGE-1
55
+ type: rouge
56
+ value: 26.8921
57
+ verified: true
58
+ - name: ROUGE-2
59
+ type: rouge
60
+ value: 6.9411
61
+ verified: true
62
+ - name: ROUGE-L
63
+ type: rouge
64
+ value: 21.2832
65
+ verified: true
66
+ - name: ROUGE-LSUM
67
+ type: rouge
68
+ value: 21.284
69
+ verified: true
70
+ - name: loss
71
+ type: loss
72
+ value: 2.5411810874938965
73
+ verified: true
74
+ - name: gen_len
75
+ type: gen_len
76
+ value: 18.7755
77
+ verified: true
78
  ---
79
 
80
  # T5-large Summarization Model Trained on the XSUM Dataset