AswiN037 commited on
Commit
4c146c1
1 Parent(s): de2ec3b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -3
README.md CHANGED
@@ -1,5 +1,19 @@
1
- tokenizer - BPE 30_522 vocab size
2
- model - Roberta
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  trained using MLM
4
  OSCAR dataset
5
- size 1000 lines
 
1
+ ---
2
+ language:
3
+ - Tamil
4
+ tags:
5
+ - Tamil-Tokenizer
6
+ - Tamil-language-model
7
+ license: "apache-2.0"
8
+ datasets:
9
+ - oscar
10
+ ---
11
+
12
+
13
+ # tokenizer - BPE 30_522 vocab size
14
+
15
+
16
+ ## model - Roberta
17
  trained using MLM
18
  OSCAR dataset
19
+ train data size 5000 lines olly