donut_experiment_bayesian_trial_12

This model is a fine-tuned version of naver-clova-ix/donut-base on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.5083
Bleu: 0.0675
Precisions: [0.8421052631578947, 0.7822966507177034, 0.7423822714681441, 0.7006578947368421]
Brevity Penalty: 0.0883
Length Ratio: 0.2918
Translation Length: 475
Reference Length: 1628
Cer: 0.7537
Wer: 0.8211

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Bleu	Precisions	Brevity Penalty	Length Ratio	Translation Length	Reference Length	Cer	Wer
0.0251	1.0	253	0.4936	0.0660	[0.8375527426160337, 0.7673860911270983, 0.7277777777777777, 0.6897689768976898]	0.0876	0.2912	474	1628	0.7600	0.8274
0.0144	2.0	506	0.4987	0.0683	[0.8445378151260504, 0.7852028639618138, 0.7458563535911602, 0.7049180327868853]	0.0889	0.2924	476	1628	0.7515	0.8189
0.0089	3.0	759	0.5083	0.0675	[0.8421052631578947, 0.7822966507177034, 0.7423822714681441, 0.7006578947368421]	0.0883	0.2918	475	1628	0.7537	0.8211