Edit model card

20231130_Clinic-T5-Sci_30ep_Summ

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8446

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 16
  • eval_batch_size: 8
  • seed: 42
  • gradient_accumulation_steps: 2
  • total_train_batch_size: 32
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 500
  • num_epochs: 30

Training results

Training Loss Epoch Step Validation Loss
No log 1.0 38 13.2074
No log 2.0 76 12.3126
No log 3.0 114 10.8187
No log 4.0 152 8.3675
No log 5.0 190 3.8696
No log 6.0 228 2.2954
No log 7.0 266 1.4198
No log 8.0 304 1.2200
No log 9.0 342 1.1076
No log 10.0 380 0.9902
No log 11.0 418 0.9378
No log 12.0 456 0.9063
No log 13.0 494 0.8837
5.3959 14.0 532 0.8709
5.3959 15.0 570 0.8629
5.3959 16.0 608 0.8592
5.3959 17.0 646 0.8533
5.3959 18.0 684 0.8515
5.3959 19.0 722 0.8474
5.3959 20.0 760 0.8446

Framework versions

  • Transformers 4.34.1
  • Pytorch 2.1.1+cu121
  • Datasets 2.15.0
  • Tokenizers 0.14.1
Downloads last month
0