Edit model card

bart-cnn-pubmed-arxiv-pubmed-arxiv-arxiv-v3-e8

This model is a fine-tuned version of theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv-arxiv on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8063
  • Rouge1: 54.9922
  • Rouge2: 38.7265
  • Rougel: 41.9288
  • Rougelsum: 52.8766
  • Gen Len: 142.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 2
  • eval_batch_size: 2
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 8
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
No log 1.0 398 0.8651 53.3185 33.3722 35.8852 50.5929 142.0
0.8268 2.0 796 0.8063 53.5267 34.3205 36.9783 51.0289 142.0
0.5331 3.0 1194 0.8155 53.5409 34.9962 38.078 51.2038 142.0
0.3588 4.0 1592 0.7883 53.7055 35.0869 38.1521 51.3094 141.4815
0.3588 5.0 1990 0.7770 54.4542 37.5817 39.8734 52.1947 141.7778
0.2447 6.0 2388 0.7929 55.1571 38.8425 41.4301 53.3049 141.4444
0.1765 7.0 2786 0.7909 55.5838 38.6226 42.0453 53.543 142.0
0.13 8.0 3184 0.8063 54.9922 38.7265 41.9288 52.8766 142.0

Framework versions

  • Transformers 4.19.2
  • Pytorch 1.11.0+cu113
  • Datasets 2.2.2
  • Tokenizers 0.12.1
Downloads last month
1
Inference API
This model can be loaded on Inference API (serverless).