--- license: apache-2.0 tags: - generated_from_keras_callback - summarization model-index: - name: cv_summarization-distilbart-cnn-16-6 results: [] language: - en --- # cv_summarization-distilbart-cnn-16-6 This model is a fine-tuned version of [philschmid/tf-distilbart-cnn-12-6](https://huggingface.co/philschmid/tf-distilbart-cnn-12-6) on [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) dataset. It achieves the following results on the evaluation set: - Train Loss: 0.1445 - Validation Loss: 0.4524 - Epoch: 4 - Rouge1: 76.90 - Rouge2: 73.30 - RougeL: 76.04 - RougeLsum: 75.87 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - optimizer: {'name': 'Adam', 'learning_rate': 2e-05} - training_precision: float32 ### Training results | Train Loss | Validation Loss | Epoch | |:----------:|:---------------:|:-----:| | 0.7265 | 0.4665 | 0 | | 0.3192 | 0.4480 | 1 | | 0.2154 | 0.4395 | 2 | | 0.1587 | 0.4668 | 3 | | 0.1445 | 0.4524 | 4 | ### Framework versions - Transformers 4.30.1 - TensorFlow 2.12.0 - Datasets 2.12.0 - Tokenizers 0.13.3