ibraheemmoosa commited on
Commit
0aece10
1 Parent(s): 0826f15

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -86,7 +86,11 @@ The motivation behind this is this. When two languages share vocabularies, a mac
86
 
87
  For many of the scripts currently in use, there are standard transliteration schemes to convert to the Latin script. In particular, for the Indic scripts the ISO-15919 transliteration scheme is designed to consistently transliterate texts written in different Indic scripts to the Latin script.
88
 
89
- This model has been trained on ISO-15919 transliterated text of various Indo-Aryan languages.
 
 
 
 
90
 
91
  ## Training procedure
92
 
@@ -175,6 +179,4 @@ Even though we pretrain on a comparatively large multilingual corpus the model m
175
 
176
  ### BibTeX entry and citation info
177
 
178
- Coming soon!
179
-
180
-
 
86
 
87
  For many of the scripts currently in use, there are standard transliteration schemes to convert to the Latin script. In particular, for the Indic scripts the ISO-15919 transliteration scheme is designed to consistently transliterate texts written in different Indic scripts to the Latin script.
88
 
89
+ An example of ISO-15919 transliteration for a piece of Bangla text is the following:
90
+
91
+ **Original:** "রবীন্দ্রনাথ ঠাকুর এফআরএএস (৭ মে ১৮৬১ - ৭ আগস্ট ১৯৪১; ২৫ বৈশাখ ১২৬৮ - ২২ শ্রাবণ ১৩৪৮ বঙ্গাব্দ) ছিলেন অগ্রণী বাঙালি কবি, ঔপন্যাসিক, সংগীতস্রষ্টা, নাট্যকার, চিত্রকর, ছোটগল্পকার, প্রাবন্ধিক, অভিনেতা, কণ্ঠশিল্পী ও দার্শনিক।"
92
+ **Transliterated:** 'rabīndranātha ṭhākura ēphaāraēēsa (7 mē 1861 - 7 āgasṭa 1941; 25 baiśākha 1268 - 22 śrābaṇa 1348 baṅgābda) chilēna agraṇī bāṅāli kabi, aupanyāsika, saṁgītasraṣṭā, nāṭyakāra, citrakara, chōṭagalpakāra, prābandhika, abhinētā, kaṇṭhaśilpī ō dārśanika.'
93
+
94
 
95
  ## Training procedure
96
 
 
179
 
180
  ### BibTeX entry and citation info
181
 
182
+ Coming soon!