rmihaylov
/

bert-base-bg

Model card Files Files and versions Community

rmihaylov commited on Apr 17, 2022

Commit

985c9aa

·

1 Parent(s): 5d8623a

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -16,7 +16,13 @@ tags:
 Pretrained model on Bulgarian language using a masked language modeling (MLM) objective. It was introduced in
 [this paper](https://arxiv.org/abs/1810.04805) and first released in
 [this repository](https://github.com/google-research/bert). This model is cased: it does make a difference
-between bulgarian and Bulgarian. The training data is Bulgarian text from [OSCAR](https://oscar-corpus.com/post/oscar-2019/), [Chitanka](https://chitanka.info/) and [Wikipedia](https://bg.wikipedia.org/).
 ### How to use

 Pretrained model on Bulgarian language using a masked language modeling (MLM) objective. It was introduced in
 [this paper](https://arxiv.org/abs/1810.04805) and first released in
 [this repository](https://github.com/google-research/bert). This model is cased: it does make a difference
+between bulgarian and Bulgarian.
+## Model description
+The model was trained similarly to [RuBert](https://arxiv.org/pdf/1905.07213.pdf) wherein the Multilingual Bert was adapted for the Russian language.
+The training data was Bulgarian text from [OSCAR](https://oscar-corpus.com/post/oscar-2019/), [Chitanka](https://chitanka.info/) and [Wikipedia](https://bg.wikipedia.org/).
 ### How to use