Edit model card

flan-t5-small-samp

This model is a fine-tuned version of google/flan-t5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2590
  • Rouge1: 43.9209
  • Rouge2: 34.9601
  • Rougel: 42.6215
  • Rougelsum: 42.2386
  • Gen Len: 19.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 5

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
No log 1.0 22 2.1815 36.7312 15.9425 31.417 34.2234 19.0
No log 2.0 44 1.6264 43.7638 34.817 42.7197 42.2012 19.0
No log 3.0 66 1.3892 43.7638 34.817 42.7197 42.2012 19.0
No log 4.0 88 1.2825 43.7624 34.875 42.5386 42.1659 19.0
No log 5.0 110 1.2590 43.9209 34.9601 42.6215 42.2386 19.0

Framework versions

  • Transformers 4.36.2
  • Pytorch 2.1.0+cu121
  • Datasets 2.16.0
  • Tokenizers 0.15.0
Downloads last month
2
Safetensors
Model size
77M params
Tensor type
F32
·

Finetuned from