gpt2-srlat-synt / README.md
procesaur's picture
Update README.md
c5bc568
metadata
license: agpl-3.0

Model is developed in support of the University of Belgrade doctoral dissertation "Composite pseudogrammars based on parallel language models of Serbian" by Mihailo Škorić.

It generates syntactly masked sentences for Serbian.

This small gpt-2 model was fine-tuned on several corpora for Serbian, augmented using Serbian Morphological Dictionaries).

The corpora include "The corpus of Contemporary Serbian", SrpELTeC and WikiKorpus by JeRTeh – Society for Language Resources and Technologies.