--- license: apache-2.0 tags: - generated_from_keras_callback model-index: - name: Jammal7/t5-small-finetuned-Big-Patents results: [] --- # Jammal7/t5-small-finetuned-Big-Patents This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on [big_patents](https://huggingface.co/datasets/big_patent) datasets. It achieves the following results on the evaluation set: - Train Loss: 2.1148 - Validation Loss: 2.0812 - Train Rouge1: 23.0058 - Train Rouge2: 12.5142 - Train Rougel: 19.9076 - Train Rougelsum: 20.8720 - Train Gen Len: 19.0 - Epoch: 14 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01} - training_precision: float32 ### Training results | Train Loss | Validation Loss | Train Rouge1 | Train Rouge2 | Train Rougel | Train Rougelsum | Train Gen Len | Epoch | |:----------:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:-------------:|:-----:| | 2.6552 | 2.3220 | 21.3700 | 10.2105 | 17.5338 | 18.8802 | 18.9929 | 0 | | 2.4395 | 2.2535 | 22.2253 | 11.2822 | 18.7238 | 19.9181 | 19.0 | 1 | | 2.3755 | 2.2174 | 22.2327 | 11.5575 | 18.8348 | 19.9379 | 19.0 | 2 | | 2.3329 | 2.1886 | 22.5328 | 11.8225 | 19.2582 | 20.3227 | 19.0 | 3 | | 2.3004 | 2.1681 | 22.7872 | 12.1642 | 19.5194 | 20.5907 | 19.0 | 4 | | 2.2696 | 2.1579 | 22.6969 | 12.0848 | 19.5012 | 20.5564 | 19.0 | 5 | | 2.2487 | 2.1385 | 22.7300 | 12.2347 | 19.5685 | 20.5822 | 19.0 | 6 | | 2.2249 | 2.1347 | 22.7250 | 12.2077 | 19.5103 | 20.5246 | 19.0 | 7 | | 2.2060 | 2.1265 | 22.8921 | 12.4139 | 19.6903 | 20.7126 | 19.0 | 8 | | 2.1892 | 2.1126 | 22.9440 | 12.4268 | 19.7717 | 20.8156 | 19.0 | 9 | | 2.1740 | 2.1084 | 22.9201 | 12.4491 | 19.8107 | 20.8059 | 19.0 | 10 | | 2.1580 | 2.0980 | 22.9947 | 12.3798 | 19.7494 | 20.7852 | 19.0 | 11 | | 2.1414 | 2.0934 | 23.0885 | 12.5899 | 19.9531 | 20.9145 | 19.0 | 12 | | 2.1302 | 2.0809 | 23.0854 | 12.5580 | 19.9468 | 20.8737 | 19.0 | 13 | | 2.1148 | 2.0812 | 23.0058 | 12.5142 | 19.9076 | 20.8720 | 19.0 | 14 | ### Framework versions - Transformers 4.30.1 - TensorFlow 2.12.0 - Datasets 2.12.0 - Tokenizers 0.13.3