AlexWang99
/

flan-t5-base-AlexWang99

Text2Text Generation

Generated from Trainer

Inference Endpoints

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

Edit model card

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

flan-t5-base-AlexWang99

This model is a fine-tuned version of google/flan-t5-base on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 2.1123
Rouge1: 38.0335
Rouge2: 13.3555
Rougel: 28.0303
Rougelsum: 34.6374
Gen Len: 99.0657

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 12
eval_batch_size: 12
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 3

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
2.2981	1.0	1087	2.1403	37.9801	13.2974	27.8704	34.4487	100.7330
2.2233	2.0	2174	2.1171	37.5415	12.9975	27.9482	34.1073	95.4997
2.1872	3.0	3261	2.1123	38.0335	13.3555	28.0303	34.6374	99.0657

Framework versions

Transformers 4.38.2
Pytorch 2.2.1+cu121
Datasets 2.18.0
Tokenizers 0.15.2

Downloads last month: 0

Safetensors

Model size

248M params

Tensor type

F32

·

Finetuned from

Evaluation results

Metadata error: specify a dataset to view leaderboard