indobart-en-id

This model is a fine-tuned version of indobenchmark/indobart-v2. It achieves the following results on the evaluation set:

Model description

Trained on the Asian Language Treebank Project Corpus (en-id). The goal of the model is to translate from English to Indonesian.

The ALT Project Corpus has over 20,000 entries of English to Indonesian sentences. The training and evaluation data is split 80/20.

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Bleu	Gen Len
0.914	1.0	503	0.5118	14.8205	19.2874
0.5805	2.0	1006	0.4606	17.2483	19.236
0.4971	3.0	1509	0.4370	18.4965	19.2449
0.4444	4.0	2012	0.4251	19.3336	19.1765
0.4037	5.0	2515	0.4152	19.6643	19.1648
0.3732	6.0	3018	0.4103	20.0701	19.1987
0.3503	7.0	3521	0.4066	20.1264	19.1258
0.3325	8.0	4024	0.4040	20.5349	19.1571
0.3207	9.0	4527	0.4041	20.4371	19.1348
0.3116	10.0	5030	0.4035	20.4747	19.134