QCRI
/

Translation
Safetensors
m2m_100
BaselMousi commited on
Commit
46d75bb
·
verified ·
1 Parent(s): cf48289

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -3
README.md CHANGED
@@ -1,3 +1,26 @@
1
- ---
2
- license: cc-by-sa-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-sa-4.0
3
+ base_model:
4
+ - facebook/nllb-200-3.3B
5
+ pipeline_tag: translation
6
+ ---
7
+ # AraDiCE-msa-to-lev: An MSA to Levantine Machine Translation Model Based on NLLB-3.3B
8
+
9
+ This repository includes an MSA-to-LEV machine translation model. This model was used to curate dialectal benchmarks. The human post-edited benchmarks can be found <a href="https://huggingface.co/datasets/QCRI/AraDiCE" target="_blank" style="margin-right: 15px; margin-left: 10px">here.</a>
10
+
11
+ ## License
12
+
13
+ The model is distributed under the **Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0)**. The full license text can be found in the accompanying `licenses_by-nc-sa_4.0_legalcode.txt` file.
14
+
15
+ ## Citation
16
+ Please find the paper <a href="https://arxiv.org/pdf/2409.11404" target="_blank" style="margin-right: 15px; margin-left: 10px">here.</a>
17
+
18
+ ```
19
+ @article{mousi2024aradicebenchmarksdialectalcultural,
20
+ title={{AraDiCE}: Benchmarks for Dialectal and Cultural Capabilities in LLMs},
21
+ author={Basel Mousi and Nadir Durrani and Fatema Ahmad and Md. Arid Hasan and Maram Hasanain and Tameem Kabbani and Fahim Dalvi and Shammur Absar Chowdhury and Firoj Alam},
22
+ year={2024},
23
+ publisher={arXiv:2409.11404},
24
+ url={https://arxiv.org/abs/2409.11404},
25
+ }
26
+ ```