pszemraj commited on
Commit
76b5b35
1 Parent(s): 3232aed
Files changed (1) hide show
  1. README.md +35 -16
README.md CHANGED
@@ -4,13 +4,6 @@ datasets:
4
  - pszemraj/scientific_lay_summarisation-plos-norm
5
  language:
6
  - en
7
- pipeline_tag: summarization
8
- tags:
9
- - lay summaries
10
- - paper summaries
11
- - biology
12
- - medical
13
- library_name: transformers
14
  widget:
15
  - text: large earthquakes along a given fault segment do not occur at random intervals
16
  because it takes time to accumulate the strain energy for the rupture. The rates
@@ -174,15 +167,22 @@ widget:
174
  example_title: Richard & Mortimer
175
  - text: "The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building, and the tallest structure in Paris. Its base is square, measuring 125 metres (410 ft) on each side. During its construction, the Eiffel Tower surpassed the Washington Monument to become the tallest man-made structure in the world, a title it held for 41 years until the Chrysler Building in New York City was finished in 1930. It was the first structure to reach a height of 300 metres. Due to the addition of a broadcasting aerial at the top of the tower in 1957, it is now taller than the Chrysler Building by 5.2 metres (17 ft). Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct."
176
  example_title: eiffel
177
- parameters:
178
- max_length: 64
179
- min_length: 8
180
- no_repeat_ngram_size: 3
181
- early_stopping: true
182
- repetition_penalty: 3.5
183
- encoder_no_repeat_ngram_size: 4
184
- length_penalty: 0.4
185
- num_beams: 4
 
 
 
 
 
 
 
186
  ---
187
 
188
  # long-t5-tglobal-base-sci-simplify
@@ -207,6 +207,25 @@ It achieves the following results on the evaluation set:
207
 
208
  - Ability to generalize outside of the dataset domain (pubmed/bioscience type papers) has to be evaluated.
209
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
210
  ## Training procedure
211
 
212
  ### Training hyperparameters
 
4
  - pszemraj/scientific_lay_summarisation-plos-norm
5
  language:
6
  - en
 
 
 
 
 
 
 
7
  widget:
8
  - text: large earthquakes along a given fault segment do not occur at random intervals
9
  because it takes time to accumulate the strain energy for the rupture. The rates
 
167
  example_title: Richard & Mortimer
168
  - text: "The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building, and the tallest structure in Paris. Its base is square, measuring 125 metres (410 ft) on each side. During its construction, the Eiffel Tower surpassed the Washington Monument to become the tallest man-made structure in the world, a title it held for 41 years until the Chrysler Building in New York City was finished in 1930. It was the first structure to reach a height of 300 metres. Due to the addition of a broadcasting aerial at the top of the tower in 1957, it is now taller than the Chrysler Building by 5.2 metres (17 ft). Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct."
169
  example_title: eiffel
170
+ parameters:
171
+ max_length: 64
172
+ min_length: 8
173
+ no_repeat_ngram_size: 3
174
+ early_stopping: true
175
+ repetition_penalty: 3.5
176
+ encoder_no_repeat_ngram_size: 4
177
+ length_penalty: 0.4
178
+ num_beams: 4
179
+ pipeline_tag: summarization
180
+ tags:
181
+ - lay summaries
182
+ - paper summaries
183
+ - biology
184
+ - medical
185
+ library_name: transformers
186
  ---
187
 
188
  # long-t5-tglobal-base-sci-simplify
 
207
 
208
  - Ability to generalize outside of the dataset domain (pubmed/bioscience type papers) has to be evaluated.
209
 
210
+
211
+ ## Usage
212
+
213
+ It's recommended to usage this model with [beam search decoding](https://huggingface.co/docs/transformers/generation_strategies#beamsearch-decoding). If interested, you can also use the `textsum` util repo to have most of this abstracted out for you:
214
+
215
+
216
+ ```bash
217
+ pip install -U textsum
218
+ ```
219
+
220
+ ```python
221
+ from textsum.summarize import Summarizer
222
+
223
+ summarizer = Summarizer('pszemraj/long-t5-tglobal-base-sci-simplify')
224
+ text = "put the text you don't want to read here"
225
+ summary = summarizer.summarize_string(text)
226
+ print(summary)
227
+ ```
228
+
229
  ## Training procedure
230
 
231
  ### Training hyperparameters