ibraheemmoosa commited on
Commit
33426be
1 Parent(s): 120256b

Add basic tags and description to the README

Browse files
Files changed (1) hide show
  1. README.md +32 -1
README.md CHANGED
@@ -1 +1,32 @@
1
- Hello World.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - as
4
+ - bn
5
+ - gu
6
+ - hi
7
+ - mr
8
+ - ne
9
+ - or
10
+ - pa
11
+ - si
12
+ license: apache-2.0
13
+ datasets:
14
+ - oscar
15
+ tags:
16
+ - multilingual
17
+ - albert
18
+ - masked-language-modeling
19
+ - sentence-order-prediction
20
+ - fill-mask
21
+ - nlp
22
+ ---
23
+
24
+ # XLMIndic Base Uniscript
25
+
26
+ Pretrained ALBERT model on the OSCAR corpus on the languages Assamese, Bengali, Gujarati, Hindi, Marathi,
27
+ Nepali, Oriya, Panjabi and Sinhala. Like ALBERT it was pretrained using as masked language modeling (MLM)
28
+ and a sentence order prediction (SOP) objective. This model was pretrained after transliterating the text
29
+ to ISO-15919 format using the Aksharamukha library. A demo of Aksharamukha library is hosted [here](https://aksharamukha.appspot.com/converter)
30
+ where you can transliterate your text and use it on our model on the inference widget.
31
+
32
+