add usage
Browse files
README.md
CHANGED
@@ -4,13 +4,6 @@ datasets:
|
|
4 |
- pszemraj/scientific_lay_summarisation-plos-norm
|
5 |
language:
|
6 |
- en
|
7 |
-
pipeline_tag: summarization
|
8 |
-
tags:
|
9 |
-
- lay summaries
|
10 |
-
- paper summaries
|
11 |
-
- biology
|
12 |
-
- medical
|
13 |
-
library_name: transformers
|
14 |
widget:
|
15 |
- text: large earthquakes along a given fault segment do not occur at random intervals
|
16 |
because it takes time to accumulate the strain energy for the rupture. The rates
|
@@ -174,15 +167,22 @@ widget:
|
|
174 |
example_title: Richard & Mortimer
|
175 |
- text: "The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building, and the tallest structure in Paris. Its base is square, measuring 125 metres (410 ft) on each side. During its construction, the Eiffel Tower surpassed the Washington Monument to become the tallest man-made structure in the world, a title it held for 41 years until the Chrysler Building in New York City was finished in 1930. It was the first structure to reach a height of 300 metres. Due to the addition of a broadcasting aerial at the top of the tower in 1957, it is now taller than the Chrysler Building by 5.2 metres (17 ft). Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct."
|
176 |
example_title: eiffel
|
177 |
-
|
178 |
-
|
179 |
-
|
180 |
-
|
181 |
-
|
182 |
-
|
183 |
-
|
184 |
-
|
185 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
186 |
---
|
187 |
|
188 |
# long-t5-tglobal-base-sci-simplify
|
@@ -207,6 +207,25 @@ It achieves the following results on the evaluation set:
|
|
207 |
|
208 |
- Ability to generalize outside of the dataset domain (pubmed/bioscience type papers) has to be evaluated.
|
209 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
210 |
## Training procedure
|
211 |
|
212 |
### Training hyperparameters
|
|
|
4 |
- pszemraj/scientific_lay_summarisation-plos-norm
|
5 |
language:
|
6 |
- en
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
widget:
|
8 |
- text: large earthquakes along a given fault segment do not occur at random intervals
|
9 |
because it takes time to accumulate the strain energy for the rupture. The rates
|
|
|
167 |
example_title: Richard & Mortimer
|
168 |
- text: "The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building, and the tallest structure in Paris. Its base is square, measuring 125 metres (410 ft) on each side. During its construction, the Eiffel Tower surpassed the Washington Monument to become the tallest man-made structure in the world, a title it held for 41 years until the Chrysler Building in New York City was finished in 1930. It was the first structure to reach a height of 300 metres. Due to the addition of a broadcasting aerial at the top of the tower in 1957, it is now taller than the Chrysler Building by 5.2 metres (17 ft). Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct."
|
169 |
example_title: eiffel
|
170 |
+
parameters:
|
171 |
+
max_length: 64
|
172 |
+
min_length: 8
|
173 |
+
no_repeat_ngram_size: 3
|
174 |
+
early_stopping: true
|
175 |
+
repetition_penalty: 3.5
|
176 |
+
encoder_no_repeat_ngram_size: 4
|
177 |
+
length_penalty: 0.4
|
178 |
+
num_beams: 4
|
179 |
+
pipeline_tag: summarization
|
180 |
+
tags:
|
181 |
+
- lay summaries
|
182 |
+
- paper summaries
|
183 |
+
- biology
|
184 |
+
- medical
|
185 |
+
library_name: transformers
|
186 |
---
|
187 |
|
188 |
# long-t5-tglobal-base-sci-simplify
|
|
|
207 |
|
208 |
- Ability to generalize outside of the dataset domain (pubmed/bioscience type papers) has to be evaluated.
|
209 |
|
210 |
+
|
211 |
+
## Usage
|
212 |
+
|
213 |
+
It's recommended to usage this model with [beam search decoding](https://huggingface.co/docs/transformers/generation_strategies#beamsearch-decoding). If interested, you can also use the `textsum` util repo to have most of this abstracted out for you:
|
214 |
+
|
215 |
+
|
216 |
+
```bash
|
217 |
+
pip install -U textsum
|
218 |
+
```
|
219 |
+
|
220 |
+
```python
|
221 |
+
from textsum.summarize import Summarizer
|
222 |
+
|
223 |
+
summarizer = Summarizer('pszemraj/long-t5-tglobal-base-sci-simplify')
|
224 |
+
text = "put the text you don't want to read here"
|
225 |
+
summary = summarizer.summarize_string(text)
|
226 |
+
print(summary)
|
227 |
+
```
|
228 |
+
|
229 |
## Training procedure
|
230 |
|
231 |
### Training hyperparameters
|