Edit model card

bart-cnn-pubmed-arxiv-pubmed-v3-e12

This model is a fine-tuned version of theojolliffe/bart-cnn-pubmed-arxiv-pubmed on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8658
  • Rouge1: 57.2678
  • Rouge2: 43.347
  • Rougel: 47.0854
  • Rougelsum: 55.4167
  • Gen Len: 142.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 1
  • eval_batch_size: 1
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 12
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
1.2548 1.0 795 0.9154 53.4249 34.0377 36.4396 50.9884 141.8889
0.6994 2.0 1590 0.8213 54.7613 35.9428 38.3899 51.9527 142.0
0.5272 3.0 2385 0.7703 53.8561 35.4871 38.0502 51.131 141.8889
0.3407 4.0 3180 0.7764 53.9514 35.8553 39.1935 51.7005 142.0
0.2612 5.0 3975 0.7529 54.4056 36.2605 40.8003 52.0424 142.0
0.1702 6.0 4770 0.8105 54.2251 37.1441 41.2472 52.2803 142.0
0.1276 7.0 5565 0.8004 56.49 40.4009 44.018 54.2404 141.5556
0.0978 8.0 6360 0.7890 56.6339 40.9867 43.9603 54.4468 142.0
0.0711 9.0 7155 0.8285 56.0469 40.7758 44.1395 53.9668 142.0
0.0649 10.0 7950 0.8498 56.9873 42.4721 46.705 55.2188 142.0
0.0471 11.0 8745 0.8547 57.7898 43.4238 46.5868 56.0858 142.0
0.0336 12.0 9540 0.8658 57.2678 43.347 47.0854 55.4167 142.0

Framework versions

  • Transformers 4.18.0
  • Pytorch 1.11.0+cu113
  • Datasets 2.1.0
  • Tokenizers 0.12.1
Downloads last month
8