Edit model card

POS tagger based on SlovakBERT

This is a POS tagger based on SlovakBERT. The model uses Universal POS tagset (UPOS). The model was fine-tuned using Slovak part of Universal Dependencies dataset [Zeman 2017] containing 10k manually annotated Slovak sentences.


The model was evaluated in our paper [Pikuliak et al 2021, Section 4.2]. It achieves 97.84%97.84\% accuracy.


  author    = {Mat{\'{u}}{\v{s}} Pikuliak and
               {\v{S}}tefan Grivalsk{\'{y}} and
               Martin Kon{\^{o}}pka and
               Miroslav Bl{\v{s}}t{\'{a}}k and
               Martin Tamajka and
               Viktor Bachrat{\'{y}} and
               Mari{\'{a}}n {\v{S}}imko and
               Pavol Bal{\'{a}}{\v{z}}ik and
               Michal Trnka and
               Filip Uhl{\'{a}}rik},
  title     = {SlovakBERT: Slovak Masked Language Model},
  journal   = {CoRR},
  volume    = {abs/2109.15254},
  year      = {2021},
  url       = {https://arxiv.org/abs/2109.15254},
  eprinttype = {arXiv},
  eprint    = {2109.15254},
Downloads last month
Hosted inference API
Token Classification
This model can be loaded on the Inference API on-demand.

Dataset used to train kinit/slovakbert-pos