cointegrated
commited on
Commit
•
c06cf87
1
Parent(s):
d0216ba
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,19 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
|
2 |
+
|
3 |
+
|
4 |
+
---
|
5 |
+
language: ["ru", "en"]
|
6 |
+
tags:
|
7 |
+
- russian
|
8 |
+
- fill-mask
|
9 |
+
- pretraining
|
10 |
+
- embeddings
|
11 |
+
license: mit
|
12 |
+
widget:
|
13 |
+
- text: "Я [MASK] в 1992 году."
|
14 |
+
---
|
15 |
+
This is a very small distilled version of the [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) model for Russian and English.
|
16 |
+
|
17 |
+
This model is useful if you want to fine-tune it for a relatively simple Russian task (e.g. NER or sentiment classification), and you care more about speed and size than about accuracy. It is approximately x10 faster than [DeepPavlov/rubert-base-cased-sentence](https://huggingface.co/DeepPavlov/rubert-base-cased-sentence).
|
18 |
+
|
19 |
+
It was trained on the [Yandex translate corpus](https://translate.yandex.ru/corpus) using MLM loss (partially distilled from `bert-base-multilingual-cased`) and translation ranking loss.
|