Marc
/

pegasus_xsum_gigaword

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

pegasus_xsum_gigaword / README.md

Marc's picture

Update README.md

e7c9ae7 over 3 years ago

|

No virus

1.96 kB

	---
	language:
	- English
	-
	thumbnail:
	tags:
	-
	-
	-
	license:
	datasets:
	- XSUM
	- Gigaword
	metrics:
	- Rouge
	-
	---

	# Pegasus XSUM Gigaword

	## Model description

	Pegasus XSUM model finetuned to Gigaword Summarization task, significantly better performance than pegasus gigaword, but still doesn't match model paper performance.

	## Intended uses & limitations
	Produces short summaries with the coherence of the XSUM Model
	#### How to use

	```python
	# You can include sample code which will be formatted
	```

	#### Limitations and bias

	Still has all the biases of any of the abstractive models, but seems a little less prone to hallucination.
	## Training data

	Initialized with pegasus-XSUM

	## Training procedure

	Trained for 11500 iterations on Gigaword corpus using OOB seq2seq (from hugging face using the default parameters)

	## Eval results
	Evaluated on Gigaword test set (from hugging face using the default parameters)
	run_summarization.py --model_name_or_path pegasus-xsum/checkpoint-11500/ --do_predict --dataset_name gigaword --dataset_config "3.0.0" --source_prefix "summarize: " --output_dir pegasus-xsum --per_device_train_batch_size=8 --per_device_eval_batch_size=8 --overwrite_output_dir --predict_with_generate

	\| Metric \| Score \|
	\| ----------- \| ----------- \|
	\| eval_rouge1 \| 34.1958 \|
	\| eval_rouge2 \| 15.4033 \|
	\| eval_rougeL \| 31.4488 \|


	run_summarization.py --model_name_or_path google/pegasus-gigaword --do_predict --dataset_name gigaword --dataset_config "3.0.0" --source_prefix "summarize: " --output_dir pegasus-xsum --per_device_train_batch_size=8 --per_device_eval_batch_size=8 --overwrite_output_dir --predict_with_generate

	\| Metric \| Score \|
	\| ----------- \| ----------- \|
	\| eval_rouge1 \| 20.8111 \|
	\| eval_rouge2 \| 8.766 \|
	\| eval_rougeL \| 18.4431 \|


	### BibTeX entry and citation info

	```bibtex
	@inproceedings{...,
	year={2020}
	}
	```