metadata

license: apache-2.0
base_model: google/flan-t5-small
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: flan-t5-small-chat
    results: []

flan-t5-small-chat

This model is a fine-tuned version of google/flan-t5-small on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.2367
Rouge1: 49.588
Rouge2: 44.597
Rougel: 46.6928
Rougelsum: 46.662
Gen Len: 19.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 4

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
No log	1.0	26	1.0858	4.9303	0.674	4.7306	4.7335	8.1731
No log	2.0	52	0.5339	39.1598	30.9426	39.0844	39.1597	16.3462
No log	3.0	78	0.2892	47.1834	41.288	45.1755	45.1471	19.0
No log	4.0	104	0.2367	49.588	44.597	46.6928	46.662	19.0

Framework versions

Transformers 4.34.1
Pytorch 2.1.0
Datasets 2.14.6
Tokenizers 0.14.1