Update README.md
Browse files
README.md
CHANGED
@@ -4,4 +4,8 @@ license: agpl-3.0
|
|
4 |
|
5 |
Model is developed in support of the University of Belgrade doctoral dissertation "Composite pseudogrammars based on parallel language models of Serbian" by Mihailo Škorić.
|
6 |
|
7 |
-
It generates syntactly masked sentences for Serbian.
|
|
|
|
|
|
|
|
4 |
|
5 |
Model is developed in support of the University of Belgrade doctoral dissertation "Composite pseudogrammars based on parallel language models of Serbian" by Mihailo Škorić.
|
6 |
|
7 |
+
It generates syntactly masked sentences for Serbian.
|
8 |
+
|
9 |
+
This small gpt-2 model was fine-tuned on several corpora for Serbian, augmented using [Serbian Morphological Dictionaries](http://poincare.matf.bg.ac.rs/~cvetana/biblio/22_Vitas_Krstev.pdf)).
|
10 |
+
|
11 |
+
The corpora include ["The corpus of Contemporary Serbian"](https://drive.google.com/file/d/1wRgoWer6YULGCXR0zWOl1fVA6VIe1DOR), [SrpELTeC](https://drive.google.com/file/d/1RtBXyw5Cdh6y_cqbJoMlYhSwNFydBRUv) and WikiKorpus by [JeRTeh – Society for Language Resources and Technologies](https://jerteh.rs/).
|