rizvi-rahil786's picture
End of training
4b3b457 verified
|
raw
history blame
2.26 kB
metadata
license: apache-2.0
base_model: t5-small
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-small-hardaDerailKP
    results: []

t5-small-hardaDerailKP

This model is a fine-tuned version of t5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.1390
  • Rouge1: 51.5946
  • Rouge2: 41.2028
  • Rougel: 51.4341
  • Rougelsum: 51.4546
  • Gen Len: 6.3538

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 4
  • eval_batch_size: 4
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 8

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
1.2197 1.0 6157 1.1987 51.1834 39.9631 51.1841 51.1643 6.7607
0.9954 2.0 12314 1.1706 50.7977 39.619 50.6689 50.6616 6.3795
0.9489 3.0 18471 1.1442 52.3555 42.2113 52.2724 52.2803 6.3484
0.8887 4.0 24628 1.1390 51.5946 41.2028 51.4341 51.4546 6.3538
0.8414 5.0 30785 1.1799 51.9184 41.1821 51.8954 51.8789 6.7852
0.753 6.0 36942 1.1829 52.4824 41.3235 52.3505 52.3882 6.6134
0.7471 7.0 43099 1.1995 51.3876 40.6408 51.2487 51.277 6.6271
0.7327 8.0 49256 1.2001 51.6537 40.8793 51.4822 51.542 6.6366

Framework versions

  • Transformers 4.39.3
  • Pytorch 2.2.1+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2