SpaCy-tl_dep_web_sm / README.md
jdoerfler's picture
Update README.md
a608df8 verified
metadata
license: apache-2.0
datasets:
  - universal-dependencies/universal_dependencies
language:
  - tl
pipeline_tag: token-classification
library_name: spacy
tags:
  - part-of-speech
  - nlp
  - spacy
  - tagger

https://github.com/jdoerfler/ar-fa-id-tl-SpaCy-Training

SpaCy morphologizer (universal POS tagger) and dependency labeler. Trained on ~12.5k samples from Universal Dependencies' tl_newscrawl-ud-train.conllu, tested on ~1.1k samples Universal Dependencies' tl_newscrawl-ud-test.conllu, tl_trg-ud-test.conllu, and tl_ugnayan-ud-test.conllu

  • uPOS accuracy: 0.9389
  • Dep head accuracy: 0.7851
  • Dep label accuracy: 0.7028

Having tokenizing issues with this one but it doesn't affect the metrics significantly, which is a red flag...