metadata

license: mit
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: bart-cnn-pubmed-arxiv-pubmed-arxiv-arxiv-v3-e8
    results: []

bart-cnn-pubmed-arxiv-pubmed-arxiv-arxiv-v3-e8

This model is a fine-tuned version of theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv-arxiv on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.8063
Rouge1: 54.9922
Rouge2: 38.7265
Rougel: 41.9288
Rougelsum: 52.8766
Gen Len: 142.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 2
eval_batch_size: 2
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 8
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
No log	1.0	398	0.8651	53.3185	33.3722	35.8852	50.5929	142.0
0.8268	2.0	796	0.8063	53.5267	34.3205	36.9783	51.0289	142.0
0.5331	3.0	1194	0.8155	53.5409	34.9962	38.078	51.2038	142.0
0.3588	4.0	1592	0.7883	53.7055	35.0869	38.1521	51.3094	141.4815
0.3588	5.0	1990	0.7770	54.4542	37.5817	39.8734	52.1947	141.7778
0.2447	6.0	2388	0.7929	55.1571	38.8425	41.4301	53.3049	141.4444
0.1765	7.0	2786	0.7909	55.5838	38.6226	42.0453	53.543	142.0
0.13	8.0	3184	0.8063	54.9922	38.7265	41.9288	52.8766	142.0

Framework versions

Transformers 4.19.2
Pytorch 1.11.0+cu113
Datasets 2.2.2
Tokenizers 0.12.1