theojolliffe
/

bart-cnn-pubmed-arxiv-pubmed-arxiv-arxiv

Text2Text Generation Transformers PyTorch TensorBoard bart generated_from_trainer Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Edit model card

bart-cnn-pubmed-arxiv-pubmed-arxiv-arxiv

This model is a fine-tuned version of theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.8065
Rouge1: 54.5916
Rouge2: 36.7817
Rougel: 40.4708
Rougelsum: 52.5754
Gen Len: 142.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 1
eval_batch_size: 1
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 5
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
1.2945	1.0	795	0.9555	51.91	32.0926	33.6727	49.5306	142.0
0.7153	2.0	1590	0.8317	52.4708	34.1035	35.2968	50.2966	141.963
0.5398	3.0	2385	0.8133	52.4603	33.497	36.4227	50.2513	141.8704
0.3568	4.0	3180	0.8091	52.3993	34.2424	37.7819	50.2069	142.0
0.2842	5.0	3975	0.8065	54.5916	36.7817	40.4708	52.5754	142.0

Framework versions

Transformers 4.18.0
Pytorch 1.11.0+cu113
Datasets 2.2.0
Tokenizers 0.12.1

Downloads last month: 2

Evaluation results

Metadata error: specify a dataset to view leaderboard