minseo0820's picture
End of training
3d14468 verified
metadata
license: apache-2.0
base_model: t5-small
tags:
  - generated_from_trainer
datasets:
  - fairytale_qa
metrics:
  - rouge
  - f1
model-index:
  - name: t5-small-finetuned-FairytaleQA-AnswerExtraction
    results:
      - task:
          name: Sequence-to-sequence Language Modeling
          type: text2text-generation
        dataset:
          name: fairytale_qa
          type: fairytale_qa
          config: default
          split: validation
          args: default
        metrics:
          - name: Rouge1
            type: rouge
            value: 10.7124
          - name: F1
            type: f1
            value: 0.2626

t5-small-finetuned-FairytaleQA-AnswerExtraction

This model is a fine-tuned version of t5-small on the fairytale_qa dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0695
  • Rouge1: 10.7124
  • Rouge2: 3.2292
  • Rougel: 10.375
  • Rougelsum: 10.3824
  • F1: 0.2626
  • Exact Match: 0.4878
  • Gen Len: 11.9668

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 4
  • eval_batch_size: 4
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 5
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum F1 Exact Match Gen Len
0.0777 1.0 2137 0.0727 11.076 3.0967 10.6128 10.6396 0.1301 0.3902 12.522
0.0727 2.0 4274 0.0707 11.288 3.2828 10.9125 10.9225 0.152 0.4878 12.161
0.0696 3.0 6411 0.0699 10.7512 3.3182 10.406 10.4123 0.2626 0.4878 12.1122
0.0719 4.0 8548 0.0696 10.803 3.2133 10.4337 10.4223 0.2626 0.4878 11.9698
0.07 5.0 10685 0.0695 10.7124 3.2292 10.375 10.3824 0.2626 0.4878 11.9668

Framework versions

  • Transformers 4.42.4
  • Pytorch 2.3.1+cu121
  • Datasets 2.20.0
  • Tokenizers 0.19.1