t5

This model is a fine-tuned version of t5-small on the wmt16 ro-en dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Bleu	Gen Len	Loss Smallest Subnet	Bleu Smallest Subnet	Gen Len Smallest Subnet	Loss Random Subnet	Loss Sum	Bleu Random Subnet	Bleu Sum	Gen Len Random Subnet	Gen Len Sum
0.5967	1.0	12715	1.3820	26.593	42.4422	1.3820	26.593	42.4422	1.3820	4.1461	26.593	79.779	42.4422	127.3266
0.5768	2.0	25430	1.3728	26.6191	42.6738	1.3728	26.6191	42.6738	1.3728	4.1184	26.6191	79.8573	42.6738	128.0214
0.5663	3.0	38145	1.3616	26.9203	42.5298	1.3616	26.9203	42.5298	1.3616	4.0849	26.9203	80.7609	42.5298	127.5894
0.5523	4.0	50860	1.3570	27.0195	42.5203	1.3570	27.0195	42.5203	1.3570	4.0709	27.0195	81.0585	42.5203	127.5609
0.5436	5.0	63575	1.3574	27.1318	42.5798	1.3574	27.1318	42.5798	1.3574	4.0723	27.1318	81.3954	42.5798	127.7394