sr_ner_tesla_bbmc / README.md
Tanor's picture
Update spaCy pipeline
2dcbbd2 verified
metadata
tags:
  - spacy
  - token-classification
language:
  - sr
license: cc-by-sa-3.0
model-index:
  - name: sr_ner_tesla_bbmc
    results:
      - task:
          name: NER
          type: token-classification
        metrics:
          - name: NER Precision
            type: precision
            value: 0.9463341916
          - name: NER Recall
            type: recall
            value: 0.944797727
          - name: NER F Score
            type: f_score
            value: 0.9455653351

sr_ner_tesla_bbmc is a spaCy model meticulously fine-tuned for Named Entity Recognition in Serbian language texts. This advanced model incorporates a transformer layer based on bert-base-multilingual-cased, enhancing its analytical capabilities. It is proficient in identifying 7 distinct categories of entities: PERS (persons), ROLE (professions), DEMO (demonyms), ORG (organizations), LOC (locations), WORK (artworks), and EVENT (events). Detailed information about these categories is available in the accompanying table. The development of this model has been made possible through the support of the Science Fund of the Republic of Serbia, under grant #7276, for the project 'Text Embeddings - Serbian Language Applications - TESLA'.

Feature Description
Name sr_ner_tesla_bbmc
Version 1.0.0
spaCy >=3.7.2,<3.8.0
Default Pipeline transformer, ner
Components transformer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources n/a
License CC BY-SA 3.0
Author Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković

Label Scheme

View label scheme (7 labels for 1 components)
Component Labels
ner DEMO, EVENT, LOC, ORG, PERS, ROLE, WORK

Accuracy

Type Score
ENTS_F 94.56
ENTS_P 94.63
ENTS_R 94.48
TRANSFORMER_LOSS 140356.48
NER_LOSS 318152.41