File size: 2,095 Bytes
20a72dc
 
 
 
 
 
 
 
5e994ef
20a72dc
 
 
 
 
 
 
 
 
5e994ef
20a72dc
0fa32cc
20a72dc
0fa32cc
20a72dc
 
 
 
afa4e55
 
 
8edcce0
 
 
b21f28c
8edcce0
 
 
 
 
9584e8b
 
8edcce0
 
 
 
 
d648dca
 
 
 
 
 
 
8edcce0
 
d528f10
 
 
379fc80
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
---
language: african-languages
tags:
- african-languages
- machine-translation
- text
license: apache-2.0
model-index:
- name: Masakhane Benchmark Models
  results:
  - task: 
      name: Machine Translation
      type: machine-translation
    dataset:
      name: masakhane benchmarks
      args: african-languages
   
---
# Interacting with the Masakhane Benchmark Models

I created this demo for very easy interaction with the [benchmark models on Masakhane](https://github.com/masakhane-io/masakhane-mt/tree/master/benchmarks) which were trained with [JoeyNMT](https://github.com/chrisemezue/joeynmt)(my forked version).

To access the space click [here](https://huggingface.co/spaces/chrisjay/masakhane-benchmarks).

To include your language, all you need to do is:
1. Create a folder in the format *src-tgt/main* for your language pair, if it does not exist.
2. Inside the *main* folder put the following files:
    1. model checkpoint. Rename it to `best.ckpt`.
    2. `config.yaml` file. This is the JoeyNMT config file which loads the model an pre-processing parameters.
    3. `src_vocab.txt` file.
    4. `trg_vocab.txt` file.
    
The space currently supports these languages:

| source language | target language |
|:---------------:|:---------------:|
| English         | Swahili         |
| English         | Afrikaans       |
| English         | Arabic          |
| English         | Urhobo          |
| English         | Ẹ̀dó             |
| Efik            | English         |
| English         | Hausa           |
| English         | Igbo            |
| English         | Fon             |
| English         | Twi             |
| English         | Dendi           |
| English         | Ẹ̀sán             |
| English         | Isoko           |
| English         | Kamba           |
| English         | Luo           |
| English         | Southern Ndebele  |
| English         | Tshivenda           |
| Shona           | English         |
| Swahili         | English         |
| Yoruba          | English         |

TO DO:
1. Include more languages from the benchmark.