metadata
license: apache-2.0
datasets:
- universal-dependencies/universal_dependencies
language:
- tl
pipeline_tag: token-classification
library_name: spacy
tags:
- part-of-speech
- nlp
- spacy
- tagger
https://github.com/jdoerfler/ar-fa-id-tl-SpaCy-Training
SpaCy morphologizer (universal POS tagger) and dependency labeler. Trained on ~12.5k samples from Universal Dependencies' tl_newscrawl-ud-train.conllu, tested on ~1.1k samples Universal Dependencies' tl_newscrawl-ud-test.conllu, tl_trg-ud-test.conllu, and tl_ugnayan-ud-test.conllu
- uPOS accuracy: 0.9389
- Dep head accuracy: 0.7851
- Dep label accuracy: 0.7028
Having tokenizing issues with this one but it doesn't affect the metrics significantly, which is a red flag...