salbatarni
/

flan-t5-small-asap_t5_f1_prompt_adherence

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Edit model card

flan-t5-small-asap_t5_f1_prompt_adherence

This model is a fine-tuned version of google/flan-t5-small on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.0647
Rouge1: 79.2006
Rouge2: 73.7804
Rougel: 79.2274
Rougelsum: 79.2405
Gen Len: 12.0471

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 5

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
No log	1.0	271	0.0993	76.0054	69.0068	76.0105	75.9722	12.0014
0.4749	2.0	542	0.0693	78.2033	72.3784	78.2208	78.1724	12.0540
0.4749	3.0	813	0.0660	79.4313	73.9365	79.4865	79.4338	12.0429
0.0883	4.0	1084	0.0644	79.2898	73.9021	79.3393	79.3354	12.0568
0.0883	5.0	1355	0.0647	79.2006	73.7804	79.2274	79.2405	12.0471

Framework versions

Transformers 4.38.2
Pytorch 2.1.2
Datasets 2.18.0
Tokenizers 0.15.2

Downloads last month: 0

Safetensors

Model size

77M params

Tensor type

F32

·

Inference Examples

Text2Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for salbatarni/flan-t5-small-asap_t5_f1_prompt_adherence

Base model

google/flan-t5-small

Finetuned

(297)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard