English2Dialect / README.md
guymorlan's picture
Update README.md
f2ede09
---
tags:
- Levantine Arabic
- Shami
- English
---
# English2Dialect
This model generates translations from English to colloquial Arabic, conditioning the translations on dialect. Supported dialects are: Palestinian/Jordanian, Syrian, Lebanese and Egyptian.
Dialect is specified via the first input token which should be either P/S/L/E.
For example, to translate to Palestinian/Jordanian, use the input:
`P What time is it now?`
And for Syrian:
`S What time is it now?`
- **Demo:** https://huggingface.co/spaces/guymorlan/English2Shami
- **Version w/o conditional generation (Levantine only):** https://huggingface.co/guymorlan/English2Shami
<!-- Provide a quick summary of what the model is/does. -->
## Training Data
The model was trained by fine-tuning the opus-mt-ar-en (MSA to English) model on ~85K parallel sentences in four dialects of colloquial Arabic.
## Model Description
<!-- Provide a longer summary of what this model is. -->
- **Developed by:** Guy Mor-Lan (guy.mor@mail.huji.ac.il)
- **Model type:** MarianMT Seq2Seq
- **License:** MIT
- **Finetuned from model:** Helsinki-NLP/opus-mt-ar-en[https://huggingface.co/Helsinki-NLP/opus-mt-ar-en]
<!--- **Repository:** [More Information Needed] -->