mdaffarudiyanto
/

t5-small-finetuned-xsum-updated

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

t5-small-finetuned-xsum-updated

This model is a fine-tuned version of t5-small on the xsum dataset. It achieves the following results on the evaluation set:

Loss: 2.0767
Rouge1: 33.2945
Rouge2: 12.0165
Rougel: 26.9804
Rougelsum: 26.9729
Gen Len: 18.7853

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0001
train_batch_size: 16
eval_batch_size: 16
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 15
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
2.5219	1.0	12753	2.3054	30.4745	9.435	24.263	24.2522	18.823
2.4191	2.0	25506	2.2385	31.2305	10.0552	24.9345	24.9254	18.7562
2.3564	3.0	38259	2.1961	31.8234	10.6556	25.6109	25.6023	18.7708
2.3028	4.0	51012	2.1692	32.2053	11.0513	26.0184	26.0056	18.772
2.2737	5.0	63765	2.1452	32.3716	11.1779	26.1423	26.1363	18.7731
2.2432	6.0	76518	2.1304	32.5413	11.2517	26.2119	26.2098	18.8007
2.2266	7.0	89271	2.1193	32.8983	11.5683	26.5995	26.5958	18.8108
2.1863	8.0	102024	2.1058	32.9046	11.6564	26.6466	26.6473	18.8008
2.1583	9.0	114777	2.0987	32.9622	11.7285	26.7161	26.7116	18.7798
2.1653	10.0	127530	2.0900	33.1259	11.8525	26.8461	26.8419	18.7999
2.1403	11.0	140283	2.0880	33.0949	11.8135	26.7863	26.7765	18.7629
2.1212	12.0	153036	2.0825	33.1671	11.8939	26.9072	26.8982	18.7825
2.1021	13.0	165789	2.0793	33.1375	11.9119	26.8466	26.8386	18.8076
2.0877	14.0	178542	2.0774	33.2516	11.9574	26.9391	26.9327	18.7989
2.0984	15.0	191295	2.0767	33.2945	12.0165	26.9804	26.9729	18.7853

Framework versions

Transformers 4.35.2
Pytorch 2.1.0+cu121
Datasets 2.15.0
Tokenizers 0.15.0

Downloads last month: 20

Safetensors

Model size

60.5M params

Tensor type

F32

·

Inference Examples

Text2Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for mdaffarudiyanto/t5-small-finetuned-xsum-updated

Base model

google-t5/t5-small

Finetuned

(1628)

this model

Dataset used to train mdaffarudiyanto/t5-small-finetuned-xsum-updated

Evaluation results

Rouge1 on xsum
validation set self-reported

33.294

View on Papers With Code