Edit model card

t5_recommendation_sports_equipment_english

This model is a fine-tuned version of t5-large on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.3870
  • Rouge1: 62.6984
  • Rouge2: 57.1429
  • Rougel: 62.6984
  • Rougelsum: 62.6984
  • Gen Len: 4.0476

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 4
  • eval_batch_size: 4
  • seed: 42
  • gradient_accumulation_steps: 4
  • total_train_batch_size: 16
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 10

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
No log 0.96 6 7.0208 13.3733 1.8519 13.7691 13.6567 18.7143
No log 1.92 12 1.8113 20.4762 14.2857 20.4762 20.9524 3.6667
No log 2.88 18 0.6189 26.9841 9.5238 26.9841 26.9841 3.7143
No log 4.0 25 0.4762 46.4286 33.3333 46.8254 46.0317 3.9524
No log 4.96 31 0.5373 57.7778 47.6190 57.9365 57.4603 4.0
No log 5.92 37 0.4113 62.6984 57.1429 63.4921 62.6984 3.8571
No log 6.88 43 0.4039 62.6984 57.1429 62.6984 62.6984 4.0952
No log 8.0 50 0.4728 62.6984 57.1429 62.6984 62.6984 4.0476
No log 8.96 56 0.4161 62.6984 57.1429 62.6984 62.6984 4.0476
No log 9.6 60 0.3870 62.6984 57.1429 62.6984 62.6984 4.0476

Framework versions

  • Transformers 4.38.2
  • Pytorch 2.2.1+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2
Downloads last month
1
Safetensors
Model size
738M params
Tensor type
F32
·

Finetuned from