2023_12_22_04_32_30

This model is a fine-tuned version of google/flan-t5-large on the background_summ dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Bertscore Precision	Bertscore Recall	Bertscore F1
No log	1.0	185	1.9484	34.1	14.4	22.5	31.0	84.5	85.7	85.1
No log	2.0	370	2.0967	30.5	12.0	20.6	27.3	83.3	85.5	84.4
1.631	3.0	555	2.2225	30.8	12.0	20.7	27.2	83.5	85.8	84.6
1.631	4.0	740	2.3722	32.7	12.4	21.5	29.1	84.1	86.1	85.1
1.631	5.0	925	2.4278	34.7	13.4	22.6	31.0	84.8	86.5	85.6
1.0815	6.0	1110	2.5025	35.4	13.5	22.7	31.8	85.1	86.6	85.8
1.0815	7.0	1295	2.6083	35.9	13.9	23.0	32.4	85.2	86.8	86.0
1.0815	8.0	1480	2.6081	36.0	13.8	22.9	32.4	85.3	86.8	86.0
0.8953	9.0	1665	2.6313	36.0	13.7	22.8	32.4	85.3	86.8	86.0
0.8953	10.0	1850	2.6624	36.2	13.9	22.9	32.7	85.4	86.8	86.1