Updated ReadMe metadata
Browse files
README.md
CHANGED
@@ -1,41 +1,46 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
|
7 |
-
|
8 |
-
This
|
9 |
-
|
10 |
-
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
}
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
|
22 |
-
|
23 |
-
```
|
24 |
-
|
25 |
-
|
26 |
-
|
27 |
-
|
28 |
-
|
29 |
-
|
30 |
-
}
|
31 |
-
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
-
|
36 |
-
|
37 |
-
|
38 |
-
|
39 |
-
|
40 |
-
|
41 |
-
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- mt
|
4 |
+
- en
|
5 |
+
---
|
6 |
+
# Machine Translation (MT) Models
|
7 |
+
|
8 |
+
This repository hosts two machine translation models that translate Maltese sentences to English.
|
9 |
+
|
10 |
+
## Models
|
11 |
+
|
12 |
+
### Model 1:
|
13 |
+
This model is sourced from [Kurt Abela's MSc Thesis repository](https://github.com/kurtabela/MSc-Thesis). If you utilize this model, kindly acknowledge it by citing the following:
|
14 |
+
|
15 |
+
```
|
16 |
+
@mastersthesis{kurt-abela-thesis,
|
17 |
+
title={Machine Translation System for Low Resource Languages},
|
18 |
+
author={Kurt Abela},
|
19 |
+
school={University of Malta},
|
20 |
+
address={Malta},
|
21 |
+
year={2023},
|
22 |
+
}
|
23 |
+
```
|
24 |
+
|
25 |
+
### Model 2:
|
26 |
+
The second model is the machine translation model used for the iWSLT 2023 shared task submission. For citation purposes, please use:
|
27 |
+
|
28 |
+
```
|
29 |
+
@inproceedings{williams2023dfki,
|
30 |
+
title={UM-DFKI Maltese Speech Translation},
|
31 |
+
author={Williams, Aiden and Abela, Kurt and Kumar, Rishu and B{\"a}r, Martin and Billinghurst, Hannah and Micallef, Kurt and Samin, Ahnaf Mozib and DeMarco, Andrea and van der Plas, Lonneke and Borg, Claudia},
|
32 |
+
booktitle={Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023)},
|
33 |
+
pages={433--441},
|
34 |
+
year={2023}
|
35 |
+
}
|
36 |
+
```
|
37 |
+
|
38 |
+
## How to Use
|
39 |
+
|
40 |
+
All models are [Fairseq](https://github.com/facebookresearch/fairseq) models. Each directory contains the best checkpoint (in the ./checkpoints/mt_en directory). The dictionaries are kept in the ./data_out/mt_en directory. The sentencepiece model and vocabulary can be found in the ./models/mt_en directory.
|
41 |
+
|
42 |
+
To use the models, we guide users to the [Fairseq documentation](https://fairseq.readthedocs.io/en/latest/getting_started.html).
|
43 |
+
|
44 |
+
## Contact
|
45 |
+
|
46 |
+
For any inquiries or issues regarding the models, feel free to [contact us](mailto:kurt.abela@um.edu.mt).
|