theojolliffe's picture
update model card README.md
4b8411e
|
raw
history blame
3.14 kB
metadata
license: mit
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: bart-cnn-pubmed-arxiv-pubmed-arxiv-earlystopping
    results: []

bart-cnn-pubmed-arxiv-pubmed-arxiv-earlystopping

This model is a fine-tuned version of theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8007
  • Rouge1: 52.4033
  • Rouge2: 34.5747
  • Rougel: 37.1754
  • Rougelsum: 50.116
  • Gen Len: 141.6481

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 2
  • eval_batch_size: 2
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 20
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
No log 0.31 125 1.2189 52.135 32.1909 33.2568 49.212 142.0
No log 0.63 250 1.0848 51.8506 31.8383 34.0225 48.9427 141.8889
No log 0.94 375 0.9838 52.0218 31.5571 32.7594 49.1574 142.0
1.1603 1.26 500 0.9675 51.5844 31.8092 33.2469 48.5004 142.0
1.1603 1.57 625 0.9470 52.3383 32.5252 34.4442 49.5981 142.0
1.1603 1.88 750 0.8849 53.1715 34.2133 35.3615 50.857 141.8148
1.1603 2.2 875 0.8490 53.5919 34.3111 36.5608 50.8721 141.6296
0.688 2.51 1000 0.8434 52.5115 33.3104 35.8243 50.0625 142.0
0.688 2.83 1125 0.8089 53.3029 33.258 35.3429 50.2641 141.963
0.688 3.14 1250 0.8768 53.2829 33.6257 36.3661 50.5444 142.0
0.688 3.45 1375 0.8256 53.5736 34.7489 36.4858 51.1342 141.8889
0.4551 3.77 1500 0.7884 54.0105 35.051 37.4089 51.2838 141.8889
0.4551 4.08 1625 0.8145 52.6526 34.173 37.4877 50.3849 141.0
0.4551 4.4 1750 0.8358 54.8493 36.3011 38.7691 51.951 142.0
0.4551 4.71 1875 0.8007 52.4033 34.5747 37.1754 50.116 141.6481

Framework versions

  • Transformers 4.18.0
  • Pytorch 1.11.0+cu113
  • Datasets 2.2.1
  • Tokenizers 0.12.1