metadata

license: mit
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: bart-cnn-pubmed-arxiv-pubmed-arxiv-earlystopping
    results: []

bart-cnn-pubmed-arxiv-pubmed-arxiv-earlystopping

This model is a fine-tuned version of theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.8007
Rouge1: 52.4033
Rouge2: 34.5747
Rougel: 37.1754
Rougelsum: 50.116
Gen Len: 141.6481

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 2
eval_batch_size: 2
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 20
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
No log	0.31	125	1.2189	52.135	32.1909	33.2568	49.212	142.0
No log	0.63	250	1.0848	51.8506	31.8383	34.0225	48.9427	141.8889
No log	0.94	375	0.9838	52.0218	31.5571	32.7594	49.1574	142.0
1.1603	1.26	500	0.9675	51.5844	31.8092	33.2469	48.5004	142.0
1.1603	1.57	625	0.9470	52.3383	32.5252	34.4442	49.5981	142.0
1.1603	1.88	750	0.8849	53.1715	34.2133	35.3615	50.857	141.8148
1.1603	2.2	875	0.8490	53.5919	34.3111	36.5608	50.8721	141.6296
0.688	2.51	1000	0.8434	52.5115	33.3104	35.8243	50.0625	142.0
0.688	2.83	1125	0.8089	53.3029	33.258	35.3429	50.2641	141.963
0.688	3.14	1250	0.8768	53.2829	33.6257	36.3661	50.5444	142.0
0.688	3.45	1375	0.8256	53.5736	34.7489	36.4858	51.1342	141.8889
0.4551	3.77	1500	0.7884	54.0105	35.051	37.4089	51.2838	141.8889
0.4551	4.08	1625	0.8145	52.6526	34.173	37.4877	50.3849	141.0
0.4551	4.4	1750	0.8358	54.8493	36.3011	38.7691	51.951	142.0
0.4551	4.71	1875	0.8007	52.4033	34.5747	37.1754	50.116	141.6481

Framework versions

Transformers 4.18.0
Pytorch 1.11.0+cu113
Datasets 2.2.1
Tokenizers 0.12.1