lewtun HF staff commited on
Commit
acea7af
1 Parent(s): de903d9

Add evaluation results on the plain_text config and validation split of launch/gov_report

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config and validation split of the [launch/gov_report](https://huggingface.co/datasets/launch/gov_report) dataset by @nonchalant-nagavalli, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-launch__gov_report-plain_text-7b7f8a-16126221).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=launch/gov_report).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=launch/gov_report).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -78,6 +78,39 @@ model-index:
78
  type: gen_len
79
  value: 186.2003
80
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81
  ---
82
 
83
  # BigBirdPegasus model (large)
 
78
  type: gen_len
79
  value: 186.2003
80
  verified: true
81
+ - task:
82
+ type: summarization
83
+ name: Summarization
84
+ dataset:
85
+ name: launch/gov_report
86
+ type: launch/gov_report
87
+ config: plain_text
88
+ split: validation
89
+ metrics:
90
+ - name: ROUGE-1
91
+ type: rouge
92
+ value: 26.6836
93
+ verified: true
94
+ - name: ROUGE-2
95
+ type: rouge
96
+ value: 3.937
97
+ verified: true
98
+ - name: ROUGE-L
99
+ type: rouge
100
+ value: 15.2322
101
+ verified: true
102
+ - name: ROUGE-LSUM
103
+ type: rouge
104
+ value: 23.5209
105
+ verified: true
106
+ - name: loss
107
+ type: loss
108
+ value: 7.046688556671143
109
+ verified: true
110
+ - name: gen_len
111
+ type: gen_len
112
+ value: 183.2895
113
+ verified: true
114
  ---
115
 
116
  # BigBirdPegasus model (large)