Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,8 @@ language:
|
|
6 |
|
7 |
Model is developed in support of the University of Belgrade doctoral dissertation "Composite pseudogrammars based on parallel language models of Serbian" by Mihailo Škorić.
|
8 |
|
9 |
-
It generates semantically masked (lemmatized and without stopwords) sentences for Serbian.
|
|
|
|
|
|
|
|
|
|
6 |
|
7 |
Model is developed in support of the University of Belgrade doctoral dissertation "Composite pseudogrammars based on parallel language models of Serbian" by Mihailo Škorić.
|
8 |
|
9 |
+
It generates semantically masked (lemmatized and without stopwords) sentences for Serbian.
|
10 |
+
|
11 |
+
This small gpt-2 model was fine-tuned on several corpora for Serbian (augmented using Latent semantic analysis methods).
|
12 |
+
|
13 |
+
The corpora include ["The corpus of Contemporary Serbian"](https://drive.google.com/file/d/1wRgoWer6YULGCXR0zWOl1fVA6VIe1DOR), [SrpELTeC](https://drive.google.com/file/d/1RtBXyw5Cdh6y_cqbJoMlYhSwNFydBRUv) and WikiKorpus by [JeRTeh – Society for Language Resources and Technologies](https://jerteh.rs/).
|