metadata
license: apache-2.0
base_model: google-t5/t5-base
tags:
- generated_from_trainer
metrics:
- rouge
model-index:
- name: t5-base-finetuned-stocknews_1900_100
results: []
t5-base-finetuned-stocknews_1900_100
This model is a fine-tuned version of google-t5/t5-base on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 1.2997
- Rouge1: 16.6203
- Rouge2: 8.7831
- Rougel: 13.9116
- Rougelsum: 15.4831
- Gen Len: 19.0
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 4
- eval_batch_size: 4
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 40
- mixed_precision_training: Native AMP
Training results
Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
---|---|---|---|---|---|---|---|---|
No log | 1.0 | 102 | 1.5488 | 14.6381 | 6.8963 | 12.1802 | 13.6527 | 19.0 |
No log | 2.0 | 204 | 1.4139 | 15.0451 | 6.9216 | 12.6068 | 14.1445 | 19.0 |
No log | 3.0 | 306 | 1.3627 | 15.3864 | 7.115 | 12.6537 | 14.267 | 19.0 |
No log | 4.0 | 408 | 1.3288 | 15.6891 | 7.5106 | 13.0451 | 14.6203 | 19.0 |
1.8685 | 5.0 | 510 | 1.3087 | 15.8071 | 7.6382 | 13.103 | 14.7587 | 19.0 |
1.8685 | 6.0 | 612 | 1.2938 | 15.6775 | 7.6448 | 13.0823 | 14.6034 | 19.0 |
1.8685 | 7.0 | 714 | 1.2870 | 15.7672 | 7.89 | 13.3325 | 14.7821 | 19.0 |
1.8685 | 8.0 | 816 | 1.2779 | 16.1616 | 8.1642 | 13.4471 | 15.0305 | 19.0 |
1.8685 | 9.0 | 918 | 1.2731 | 16.3679 | 8.4804 | 13.7618 | 15.3468 | 19.0 |
1.1991 | 10.0 | 1020 | 1.2695 | 16.2821 | 8.456 | 13.7692 | 15.2461 | 19.0 |
1.1991 | 11.0 | 1122 | 1.2647 | 16.4056 | 8.5019 | 13.7217 | 15.3711 | 19.0 |
1.1991 | 12.0 | 1224 | 1.2667 | 16.4259 | 8.6692 | 13.8396 | 15.4122 | 19.0 |
1.1991 | 13.0 | 1326 | 1.2654 | 16.6988 | 8.9574 | 14.0239 | 15.6864 | 19.0 |
1.1991 | 14.0 | 1428 | 1.2648 | 16.7394 | 9.0588 | 14.0529 | 15.6644 | 19.0 |
1.0382 | 15.0 | 1530 | 1.2642 | 16.6864 | 9.106 | 13.9046 | 15.5687 | 19.0 |
1.0382 | 16.0 | 1632 | 1.2662 | 16.6786 | 8.8288 | 13.9603 | 15.5724 | 19.0 |
1.0382 | 17.0 | 1734 | 1.2651 | 16.7446 | 8.9211 | 13.9999 | 15.6617 | 19.0 |
1.0382 | 18.0 | 1836 | 1.2702 | 16.6361 | 8.8503 | 14.0324 | 15.546 | 19.0 |
1.0382 | 19.0 | 1938 | 1.2676 | 16.7046 | 9.0089 | 14.073 | 15.6342 | 19.0 |
0.9273 | 20.0 | 2040 | 1.2732 | 16.4339 | 8.6714 | 13.8422 | 15.44 | 19.0 |
0.9273 | 21.0 | 2142 | 1.2743 | 16.5655 | 8.7747 | 13.839 | 15.4958 | 19.0 |
0.9273 | 22.0 | 2244 | 1.2781 | 16.7749 | 8.9154 | 14.1216 | 15.6395 | 19.0 |
0.9273 | 23.0 | 2346 | 1.2814 | 16.535 | 8.7436 | 13.971 | 15.5056 | 19.0 |
0.9273 | 24.0 | 2448 | 1.2795 | 16.6612 | 8.7045 | 14.0096 | 15.5692 | 19.0 |
0.8539 | 25.0 | 2550 | 1.2844 | 16.6083 | 8.6106 | 13.9202 | 15.5641 | 19.0 |
0.8539 | 26.0 | 2652 | 1.2817 | 16.6449 | 8.8127 | 14.0562 | 15.5792 | 19.0 |
0.8539 | 27.0 | 2754 | 1.2856 | 16.6185 | 8.7475 | 14.0134 | 15.5439 | 19.0 |
0.8539 | 28.0 | 2856 | 1.2868 | 16.4913 | 8.7293 | 13.9367 | 15.4702 | 19.0 |
0.8539 | 29.0 | 2958 | 1.2905 | 16.4887 | 8.6461 | 13.8893 | 15.4342 | 19.0 |
0.8006 | 30.0 | 3060 | 1.2893 | 16.5861 | 8.695 | 13.9081 | 15.4307 | 19.0 |
0.8006 | 31.0 | 3162 | 1.2919 | 16.5972 | 8.8314 | 13.9069 | 15.4967 | 19.0 |
0.8006 | 32.0 | 3264 | 1.2940 | 16.5957 | 8.789 | 13.9202 | 15.4839 | 19.0 |
0.8006 | 33.0 | 3366 | 1.2946 | 16.6313 | 8.8011 | 13.9684 | 15.5256 | 19.0 |
0.8006 | 34.0 | 3468 | 1.2945 | 16.6711 | 8.8915 | 14.0228 | 15.5394 | 19.0 |
0.7598 | 35.0 | 3570 | 1.2970 | 16.67 | 8.891 | 13.9749 | 15.5174 | 19.0 |
0.7598 | 36.0 | 3672 | 1.2975 | 16.6223 | 8.7522 | 13.9528 | 15.4761 | 19.0 |
0.7598 | 37.0 | 3774 | 1.2987 | 16.6444 | 8.8594 | 13.9567 | 15.5117 | 19.0 |
0.7598 | 38.0 | 3876 | 1.2993 | 16.6444 | 8.8594 | 13.9567 | 15.5117 | 19.0 |
0.7598 | 39.0 | 3978 | 1.2996 | 16.6196 | 8.8108 | 13.9213 | 15.4806 | 19.0 |
0.7463 | 40.0 | 4080 | 1.2997 | 16.6203 | 8.7831 | 13.9116 | 15.4831 | 19.0 |
Framework versions
- Transformers 4.38.2
- Pytorch 2.1.0+cu121
- Datasets 2.18.0
- Tokenizers 0.15.2