edaiofficial's picture
additional commits for edo and urhobo
a2c106c

English to Urhobo

Author: iroro orife

Data

- The JW300 English-Urhobo dataset.

Model

Analysis

The dataset requires more preprocessing to remove special characters and Scripture chapters/verse names & figures. This will make the model more generally useful outside of religious text translations.

Example 1

    Source:     But freedom from what ?
    Reference:  Ẹkẹvuọvo , ẹdia vọ yen egbomọphẹ na che si ayen nu ?
    Hypothesis: ( 1 Pita 3 : 1 ) Ẹkẹvuọvo , die yen egbomọphẹ 

Example 2

    Source:     Today he is serving at Bethel .
    Reference:  Nonẹna , ọ ga vwẹ Bẹtẹl .
    Hypothesis: Nonẹna , ọ ga vwẹ Bẹtẹl asaọkiephana .

Results

Tokenization BLEU dev BLEU test
BPE 15.91 28.82
Word-level 11.80 22.39