pegasus-inshorts / README.md
kubershahi's picture
Update README.md
be43789
|
raw
history blame
1.83 kB
---
language: en
tags:
- abstractive summarization
model-index:
- name: kubershahi/pegasus-inshorts
results:
- task:
type: abstractitive summarization
name: abstractive summarization
dataset:
name: inshorts
type: inshorts
config: inshorts
split: train
metrics:
- name: ROUGE-1
type: rouge
value: 4.2525
verified: true
- name: ROUGE-2
type: rouge
value: 4.2525
verified: true
- name: ROUGE-L
type: rouge
value: 17.4469
verified: true
- name: ROUGE-LSUM
type: rouge
value: 18.8907
verified: true
- name: loss
type: loss
value: 3.0317161083221436
verified: true
- name: gen_len
type: gen_len
value: 20.3122
verified: true
---
# Problem Statment:
Given a news article, generate a summary of two-to-three sentences and a headline for the article. The summary should be abstractive rather than extractive.
In abstractive summarization, new sentences are generated as part of the summary and the sentences in the summary might not be present in the news article.
# Model Description
This model builds on the [google/pegasus-large](https://huggingface.co/google/pegasus-large) model by finetuning it on a custom summary-headline dataset called [inshorts](https://github.com/kubershahi/ashoka-aml/blob/master/dataset/news_headline.csv).
After finetuning, to generate an appropriate headline of an article, get the summary of the article first from the pegasus-large model and then pass the summary through this model.
The two-way approach was taken to get apt headline from summary rather then generating the headline from the pegasus-large itself.
For more details about the project, click [here](https://github.com/kubershahi/ashoka-aml).