# English to Luo Author: Kathleen Siminyu ## Data - The JW300 English-Luo dataset, 154941 lines. ## Model - Link to google drive folder with model(https://drive.google.com/drive/folders/1_q4Gj4cIAbhNxTU3XOa1OH9Sxtg2cvEM?usp=sharing) ## Analysis - Tried out different BPE settings and managed some improvements on the baseline. Highest BLEU score I recorded was at BPE 30000. It might be worth exploring different bpe settings for the source and target languages. Results from different settings included in the analysis below. BPE 4000 ```sh BLEU dev: 23.22 BLEU test: 32.64 ``` BPE 5000 ```sh 2020-05-17 17:50:01,865 - dev bleu: 23.32 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 17:51:18,137 - test bleu: 32.52 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 10000 ```sh 2020-05-17 17:52:12,396 - dev bleu: 24.26 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 17:53:26,893 - test bleu: 32.97 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 15000 ```sh 2020-05-17 17:54:28,069 - dev bleu: 24.55 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 17:55:46,813 - test bleu: 33.43 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 20000 ```sh 2020-05-17 18:01:41,447 - dev bleu: 25.34 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 18:03:03,470 - test bleu: 33.38 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 25000 ```sh 2020-05-17 18:09:26,069 - dev bleu: 24.81 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 18:10:45,675 - test bleu: 33.68 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 35000 ```sh 2020-05-17 20:35:45,530 - dev bleu: 25.08 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 20:36:23,282 - test bleu: 33.70 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 40000 ```sh 2020-05-17 20:43:32,542 - dev bleu: 24.75 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 20:44:11,245 - test bleu: 33.58 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` # Results BPE 30000 ``` 2020-05-17 20:28:46,395 - dev bleu: 25.24 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-17 20:29:20,937 - test bleu: 34.33 [Beam search decoding with beam size = 5 and alpha = 1.0] ```