walkerrose
/

cv_summarization-distilbart-cnn-16-6

text2text-generation

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

cv_summarization-distilbart-cnn-16-6 / README.md

walkerrose's picture

Update README.md

8038c25 about 1 year ago

|

history blame contribute delete

No virus

1.55 kB

	---
	license: apache-2.0
	tags:
	- generated_from_keras_callback
	- summarization
	model-index:
	- name: cv_summarization-distilbart-cnn-16-6
	results: []
	language:
	- en
	---

	<!-- This model card has been generated automatically according to the information Keras had access to. You should
	probably proofread and complete it, then remove this comment. -->

	# cv_summarization-distilbart-cnn-16-6

	This model is a fine-tuned version of [philschmid/tf-distilbart-cnn-12-6](https://huggingface.co/philschmid/tf-distilbart-cnn-12-6) on [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) dataset.
	It achieves the following results on the evaluation set:
	- Train Loss: 0.1445
	- Validation Loss: 0.4524
	- Epoch: 4
	- Rouge1: 76.90
	- Rouge2: 73.30
	- RougeL: 76.04
	- RougeLsum: 75.87

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- optimizer: {'name': 'Adam', 'learning_rate': 2e-05}
	- training_precision: float32

	### Training results

	\| Train Loss \| Validation Loss \| Epoch \|
	\|:----------:\|:---------------:\|:-----:\|
	\| 0.7265 \| 0.4665 \| 0 \|
	\| 0.3192 \| 0.4480 \| 1 \|
	\| 0.2154 \| 0.4395 \| 2 \|
	\| 0.1587 \| 0.4668 \| 3 \|
	\| 0.1445 \| 0.4524 \| 4 \|


	### Framework versions

	- Transformers 4.30.1
	- TensorFlow 2.12.0
	- Datasets 2.12.0
	- Tokenizers 0.13.3