# English to Kikuyu Author: Kathleen Siminyu ## Data - The JW300 English-Kikuyu dataset, 106699 lines. ## Model - Link to google drive folder with model(https://drive.google.com/open?id=1kjb2hXaSaG-Esl_SHJIptTZ9-Gw-er6f) ## Analysis - Tried out different BPE settings and managed some improvements on the baseline. Highest BLEU score I recorded was at BPE 20000. It might be worth exploring different bpe settings for the source and target languages. Results from different settings included in the analysis below. BPE 4000 ```sh BLEU dev: 23.83 BLEU test: 36.06 ``` BPE 5000 ```sh 2020-05-02 16:15:52,007 - dev bleu: 23.90 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-02 16:19:02,219 - test bleu: 36.53 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 10000 ```sh 2020-05-02 16:24:16,890 - dev bleu: 25.08 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-02 16:27:32,801 - test bleu: 37.39 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 15000 ```sh 2020-05-02 16:29:11,281 - dev bleu: 25.60 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-02 16:32:22,789 - test bleu: 37.75 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 25000 ```sh 2020-05-02 16:38:37,910 - dev bleu: 25.13 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-02 16:41:35,004 - test bleu: 37.35 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 30000 ```sh 2020-05-02 16:43:32,174 - dev bleu: 25.36 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-02 16:46:28,748 - test bleu: 37.35 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` BPE 35000 ```sh 2020-05-02 16:48:32,963 - dev bleu: 25.95 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-02 16:51:30,421 - test bleu: 37.77 [Beam search decoding with beam size = 5 and alpha = 1.0] ``` # Results BPE 20000 ``` 2020-05-02 16:33:57,175 - dev bleu: 25.14 [Beam search decoding with beam size = 5 and alpha = 1.0] 2020-05-02 16:36:53,670 - test bleu: 37.85 [Beam search decoding with beam size = 5 and alpha = 1.0] ```