krotima1
/

mbart-at2h-s

Text2Text Generation

abstractive summarization

Inference Endpoints

Model card Files Files and versions Community

Marian Krotil commited on May 23, 2022

Commit

fa24e9e

•

1 Parent(s): c078932

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ This model is a fine-tuned checkpoint of [facebook/mbart-large-cc25](https://hug
 The model deals with the task ``Abstract + Text to Headline`` (AT2H) which consists in generating a one- or two-sentence summary considered as a headline from a Czech news text.
 ## Dataset
-The model has been trained on the [SumeCzech](https://ufal.mff.cuni.cz/sumeczech) dataset. The dataset includes around 1M Czech news-based documents consisting of a Headline, Abstract, and Full-text sections. Truncation and padding were configured for 512 tokens.
 ## Training
 The model has been trained on 1x NVIDIA Tesla A100 40GB for 40 hours. During training, the model has seen 2576K documents corresponding to roughly 3 epochs.
@@ -41,7 +41,7 @@ def summ_config():
             ("repetition_penalty", 1.2),
             ("no_repeat_ngram_size", None),
             ("early_stopping", True),
-            ("max_length", 96),
             ("min_length", 10),
         ])),
         #texts to summarize

 The model deals with the task ``Abstract + Text to Headline`` (AT2H) which consists in generating a one- or two-sentence summary considered as a headline from a Czech news text.
 ## Dataset
+The model has been trained on the [SumeCzech](https://ufal.mff.cuni.cz/sumeczech) dataset. The dataset includes around 1M Czech news-based documents consisting of a Headline, Abstract, and Full-text sections. Truncation and padding were configured for 512 tokens for the encoder and 64 for the decoder.
 ## Training
 The model has been trained on 1x NVIDIA Tesla A100 40GB for 40 hours. During training, the model has seen 2576K documents corresponding to roughly 3 epochs.
             ("repetition_penalty", 1.2),
             ("no_repeat_ngram_size", None),
             ("early_stopping", True),
+            ("max_length", 64),
             ("min_length", 10),
         ])),
         #texts to summarize