File size: 2,177 Bytes
78aa4ee
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
# English to Kikuyu

Author: Kathleen Siminyu

## Data

	- The JW300 English-Kikuyu dataset, 106699 lines.

## Model
  
	- Link to google drive folder with model(https://drive.google.com/open?id=1kjb2hXaSaG-Esl_SHJIptTZ9-Gw-er6f)

## Analysis
	- Tried out different BPE settings and managed some improvements on the baseline. Highest BLEU score I recorded was at BPE 20000. It might be worth exploring different bpe settings for the source 		and target languages. Results from different settings included in the analysis below.

BPE 4000
```sh
	BLEU dev: 23.83
	BLEU test: 36.06
```

BPE 5000
```sh
	2020-05-02 16:15:52,007 -  dev bleu:  23.90 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-02 16:19:02,219 - test bleu:  36.53 [Beam search decoding with beam size = 5 and alpha = 1.0]
```

BPE 10000
```sh
	2020-05-02 16:24:16,890 -  dev bleu:  25.08 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-02 16:27:32,801 - test bleu:  37.39 [Beam search decoding with beam size = 5 and alpha = 1.0]
```

BPE 15000
```sh
	2020-05-02 16:29:11,281 -  dev bleu:  25.60 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-02 16:32:22,789 - test bleu:  37.75 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 25000
```sh
	2020-05-02 16:38:37,910 -  dev bleu:  25.13 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-02 16:41:35,004 - test bleu:  37.35 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 30000
```sh
	2020-05-02 16:43:32,174 -  dev bleu:  25.36 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-02 16:46:28,748 - test bleu:  37.35 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 35000
```sh
	2020-05-02 16:48:32,963 -  dev bleu:  25.95 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-02 16:51:30,421 - test bleu:  37.77 [Beam search decoding with beam size = 5 and alpha = 1.0]
```

# Results
BPE 20000
```
	2020-05-02 16:33:57,175 -  dev bleu:  25.14 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-02 16:36:53,670 - test bleu:  37.85 [Beam search decoding with beam size = 5 and alpha = 1.0]
```