crodri's picture
Update README.md
aaa77cf
metadata
tags:
  - spacy
  - token-classification
language:
  - es
license: gpl-3.0
model-index:
  - name: es_cantemist_ner_trf
    results:
      - task:
          name: NER
          type: token-classification
        metrics:
          - name: NER Precision
            type: precision
            value: 0.8487622923
          - name: NER Recall
            type: recall
            value: 0.8416274378
          - name: NER F Score
            type: f_score
            value: 0.8451798075
widget:
  - text: >-
      JUICIO DIAGNÓSTICO Encefalitis límbica y polineuropatía sensitiva
      paraneoplásicas secundarias a carcinoma microcítico de pulmón cTxN2 M0
      (enfermedad limitada) .

Basic Spacy BioNER pipeline, with a RoBERTa-based model [bsc-bio-ehr-es] (https://huggingface.co/PlanTL-GOB-ES/bsc-bio-ehr-es) and a dataset, CANTEMIST, annotated with tumour morphology entities. For further information, check the official website. Visit our GitHub repository. This work was funded by the Spanish State Secretariat for Digitalization and Artificial Intelligence (SEDIA) within the framework of the Plan-TL

Feature Description
Name es_cantemist_ner_trf
Version 3.4.0
spaCy >=3.4.0,<3.5.0
Default Pipeline transformer, ner
Components transformer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources https://huggingface.co/datasets/PlanTL-GOB-ES/cantemist-ner
License [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0)
Author The Text Mining Unit from Barcelona Supercomputing Center.
Copyright Copyright by the Spanish State Secretariat for Digitalization and Artificial Intelligence (SEDIA) (2022)
Funding This work was funded by the Spanish State Secretariat for Digitalization and Artificial Intelligence (SEDIA) within the framework of the Plan-TL

Label Scheme

View label scheme (1 labels for 1 components)
Component Labels
ner MORFOLOGIA_NEOPLASIA

Accuracy

Type Score
ENTS_F 84.52
ENTS_P 84.88
ENTS_R 84.16
TRANSFORMER_LOSS 25646.78
NER_LOSS 9622.84