File size: 1,233 Bytes
3448871
 
 
 
 
d5ee2d3
3448871
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f2ede09
3448871
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
---
tags:
- Levantine Arabic
- Shami
- English
- Egyptian
---

# English2Dialect

This model generates translations from English to colloquial Arabic, conditioning the translations on dialect. Supported dialects are: Palestinian/Jordanian, Syrian, Lebanese and Egyptian.
Dialect is specified via the first input token which should be either P/S/L/E.

For example, to translate to Palestinian/Jordanian, use the input:

`P What time is it now?`

And for Syrian:

`S What time is it now?`

- **Demo:** https://huggingface.co/spaces/guymorlan/English2Shami
- **Version w/o conditional generation (Levantine only):** https://huggingface.co/guymorlan/English2Shami
<!-- Provide a quick summary of what the model is/does. -->

## Training Data
The model was trained by fine-tuning the opus-mt-ar-en (MSA to English) model on ~85K parallel sentences in four dialects of colloquial Arabic.

## Model Description

<!-- Provide a longer summary of what this model is. -->

- **Developed by:** Guy Mor-Lan (guy.mor@mail.huji.ac.il)
- **Model type:** MarianMT Seq2Seq
- **License:** MIT
- **Finetuned from model:** Helsinki-NLP/opus-mt-ar-en[https://huggingface.co/Helsinki-NLP/opus-mt-ar-en]

<!--- **Repository:** [More Information Needed] -->