pakawadeep's picture
Training in progress epoch 15
5feadbf
|
raw
history blame
No virus
3.56 kB
metadata
license: apache-2.0
base_model: pakawadeep/mt5-base-finetuned-ctfl
tags:
  - generated_from_keras_callback
model-index:
  - name: pakawadeep/mt5-base-finetuned-ctfl
    results: []

pakawadeep/mt5-base-finetuned-ctfl

This model is a fine-tuned version of pakawadeep/mt5-base-finetuned-ctfl on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 0.5173
  • Validation Loss: 0.9877
  • Train Rouge1: 8.2037
  • Train Rouge2: 1.6832
  • Train Rougel: 8.0622
  • Train Rougelsum: 8.0269
  • Train Gen Len: 11.9505
  • Epoch: 15

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
  • training_precision: float32

Training results

Train Loss Validation Loss Train Rouge1 Train Rouge2 Train Rougel Train Rougelsum Train Gen Len Epoch
1.1067 1.0353 7.4965 1.6832 7.4257 7.3904 11.8762 0
0.9573 1.0010 7.9915 1.6832 7.9208 7.7793 11.9109 1
0.8858 1.0002 8.4866 2.1782 8.2744 8.2744 11.9158 2
0.8402 0.9827 8.4866 2.1782 8.2744 8.2744 11.9554 3
0.7900 0.9961 8.4866 2.1782 8.2744 8.2744 11.9158 4
0.7646 0.9898 8.4866 2.1782 8.2744 8.2744 11.9505 5
0.7190 0.9805 8.4866 2.1782 8.2744 8.2744 11.9208 6
0.7021 0.9683 8.4866 2.1782 8.2744 8.2744 11.9455 7
0.6613 0.9732 8.9816 2.1782 8.7694 8.8755 11.9703 8
0.6416 0.9807 8.4866 2.1782 8.2744 8.2744 11.9505 9
0.6139 0.9771 8.4866 2.1782 8.2744 8.2744 11.9307 10
0.5864 0.9723 8.4866 2.1782 8.2744 8.2744 11.9505 11
0.5844 0.9919 8.4866 2.1782 8.2744 8.2744 11.9653 12
0.5679 1.0097 8.4866 2.1782 8.2744 8.2744 11.9307 13
0.5329 0.9947 7.9915 1.1881 7.8501 7.7793 11.9554 14
0.5173 0.9877 8.2037 1.6832 8.0622 8.0269 11.9505 15

Framework versions

  • Transformers 4.38.2
  • TensorFlow 2.15.0
  • Datasets 2.18.0
  • Tokenizers 0.15.2