metadata

license: apache-2.0
base_model: t5-small
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-small-hardaDerailKP
    results: []

t5-small-hardaDerailKP

This model is a fine-tuned version of t5-small on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 1.1390
Rouge1: 51.5946
Rouge2: 41.2028
Rougel: 51.4341
Rougelsum: 51.4546
Gen Len: 6.3538

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 4
eval_batch_size: 4
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 8

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
1.2197	1.0	6157	1.1987	51.1834	39.9631	51.1841	51.1643	6.7607
0.9954	2.0	12314	1.1706	50.7977	39.619	50.6689	50.6616	6.3795
0.9489	3.0	18471	1.1442	52.3555	42.2113	52.2724	52.2803	6.3484
0.8887	4.0	24628	1.1390	51.5946	41.2028	51.4341	51.4546	6.3538
0.8414	5.0	30785	1.1799	51.9184	41.1821	51.8954	51.8789	6.7852
0.753	6.0	36942	1.1829	52.4824	41.3235	52.3505	52.3882	6.6134
0.7471	7.0	43099	1.1995	51.3876	40.6408	51.2487	51.277	6.6271
0.7327	8.0	49256	1.2001	51.6537	40.8793	51.4822	51.542	6.6366

Framework versions

Transformers 4.39.3
Pytorch 2.2.1+cu121
Datasets 2.18.0
Tokenizers 0.15.2