ctoraman
/

RoBERTa-TR-medium-word-66k

Inference Endpoints

Model card Files Files and versions Community

ctoraman commited on Apr 20, 2022

Commit

8ff6a58

•

1 Parent(s): 4cb9519

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ The pretrained corpus is OSCAR's Turkish split, but it is further filtered and c
 Model architecture is similar to bert-medium (8 layers, 8 heads, and 512 hidden size). Tokenization algorithm is Word-level, which means text is split by white space. Vocabulary size is 66.7k.
-The details can be found at this paper:
 https://arxiv.org/abs/2204.08832
 The following code can be used for model loading and tokenization, example max length (514) can be changed:

 Model architecture is similar to bert-medium (8 layers, 8 heads, and 512 hidden size). Tokenization algorithm is Word-level, which means text is split by white space. Vocabulary size is 66.7k.
+The details and performance comparisons can be found at this paper:
 https://arxiv.org/abs/2204.08832
 The following code can be used for model loading and tokenization, example max length (514) can be changed: