autoevaluator HF staff commited on
Commit
15bfbb6
1 Parent(s): 2b0a387

Add evaluation results on the default config and test split of billsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [billsum](https://huggingface.co/datasets/billsum) dataset by @Artifact-AI, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-billsum-default-8a9925-2249471725).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=billsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=billsum).

Files changed (1) hide show
  1. README.md +36 -0
README.md CHANGED
@@ -113,6 +113,42 @@ widget:
113
  medical parole hearing or medical parole release is scheduled for an inmate receiving
114
  medical parole consideration, regardless of whether the inmate is sentenced either
115
  determinately or indeterminately.'
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
116
  ---
117
 
118
  # Longformer Encoder-Decoder (LED) fine-tuned on Billsum
 
113
  medical parole hearing or medical parole release is scheduled for an inmate receiving
114
  medical parole consideration, regardless of whether the inmate is sentenced either
115
  determinately or indeterminately.'
116
+ model-index:
117
+ - name: Artifact-AI/led_base_16384_billsum_summarization
118
+ results:
119
+ - task:
120
+ type: summarization
121
+ name: Summarization
122
+ dataset:
123
+ name: billsum
124
+ type: billsum
125
+ config: default
126
+ split: test
127
+ metrics:
128
+ - name: ROUGE-1
129
+ type: rouge
130
+ value: 47.6721
131
+ verified: true
132
+ - name: ROUGE-2
133
+ type: rouge
134
+ value: 26.7372
135
+ verified: true
136
+ - name: ROUGE-L
137
+ type: rouge
138
+ value: 34.5681
139
+ verified: true
140
+ - name: ROUGE-LSUM
141
+ type: rouge
142
+ value: 41.5295
143
+ verified: true
144
+ - name: loss
145
+ type: loss
146
+ value: 2.079916000366211
147
+ verified: true
148
+ - name: gen_len
149
+ type: gen_len
150
+ value: 128.8152
151
+ verified: true
152
  ---
153
 
154
  # Longformer Encoder-Decoder (LED) fine-tuned on Billsum