cointegrated commited on
Commit
c06cf87
1 Parent(s): d0216ba

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+
3
+
4
+ ---
5
+ language: ["ru", "en"]
6
+ tags:
7
+ - russian
8
+ - fill-mask
9
+ - pretraining
10
+ - embeddings
11
+ license: mit
12
+ widget:
13
+ - text: "Я [MASK] в 1992 году."
14
+ ---
15
+ This is a very small distilled version of the [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) model for Russian and English.
16
+
17
+ This model is useful if you want to fine-tune it for a relatively simple Russian task (e.g. NER or sentiment classification), and you care more about speed and size than about accuracy. It is approximately x10 faster than [DeepPavlov/rubert-base-cased-sentence](https://huggingface.co/DeepPavlov/rubert-base-cased-sentence).
18
+
19
+ It was trained on the [Yandex translate corpus](https://translate.yandex.ru/corpus) using MLM loss (partially distilled from `bert-base-multilingual-cased`) and translation ranking loss.