metadata
tags:
- spacy
- token-classification
language:
- mk
license: cc-by-sa-4.0
model-index:
- name: mk_core_news_sm
results:
- task:
name: NER
type: token-classification
metrics:
- name: NER Precision
type: precision
value: 0.7299912049
- name: NER Recall
type: recall
value: 0.7063829787
- name: NER F Score
type: f_score
value: 0.7179930796
- task:
name: SENTER
type: token-classification
metrics:
- name: SENTER Precision
type: precision
value: 0.6710526316
- name: SENTER Recall
type: recall
value: 0.6623376623
- name: SENTER F Score
type: f_score
value: 0.6666666667
- task:
name: UNLABELED_DEPENDENCIES
type: token-classification
metrics:
- name: Unlabeled Dependencies Accuracy
type: accuracy
value: 0.6307541626
- task:
name: LABELED_DEPENDENCIES
type: token-classification
metrics:
- name: Labeled Dependencies Accuracy
type: accuracy
value: 0.6307541626
Details: https://spacy.io/models/mk#mk_core_news_sm
Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.
Feature | Description |
---|---|
Name | mk_core_news_sm |
Version | 3.2.0 |
spaCy | >=3.2.0,<3.3.0 |
Default Pipeline | morphologizer , parser , attribute_ruler , lemmatizer , ner |
Components | morphologizer , parser , senter , attribute_ruler , lemmatizer , ner |
Vectors | 0 keys, 0 unique vectors (0 dimensions) |
Sources | Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) spaCy lookups data (Explosion) |
License | CC BY-SA 4.0 |
Author | Explosion |
Label Scheme
View label scheme (55 labels for 4 components)
Component | Labels |
---|---|
morphologizer |
POS=PROPN , POS=AUX , POS=ADJ , POS=NOUN , POS=ADP , POS=PUNCT , POS=CONJ , POS=NUM , POS=VERB , POS=PRON , POS=ADV , POS=SCONJ , POS=PART , POS=SYM , POS=X , _ , POS=INTJ |
parser |
ROOT , advmod , att , aux , cc , dep , det , dobj , iobj , neg , nsubj , pobj , poss , pozm , pozv , prep , punct , relcl |
senter |
I , S |
ner |
CARDINAL , DATE , EVENT , FAC , GPE , LANGUAGE , LAW , LOC , MONEY , NORP , ORDINAL , ORG , PERCENT , PERSON , PRODUCT , QUANTITY , TIME , WORK_OF_ART |
Accuracy
Type | Score |
---|---|
TOKEN_ACC |
100.00 |
TOKEN_P |
100.00 |
TOKEN_R |
100.00 |
TOKEN_F |
100.00 |
SENTS_P |
67.11 |
SENTS_R |
66.23 |
SENTS_F |
66.67 |
ENTS_P |
73.00 |
ENTS_R |
70.64 |
ENTS_F |
71.80 |
POS_ACC |
91.64 |
DEP_UAS |
63.08 |
DEP_LAS |
47.60 |