Basic Spacy BioNER pipeline, with a RoBERTa-based model [bsc-bio-ehr-es] (https://huggingface.co/PlanTL-GOB-ES/bsc-bio-ehr-es) and a dataset, CANTEMIST, annotated with tumor morphology entities. For further information, check the [official website](https://temu.bsc.es/cantemist/). Visit our [GitHub repository](https://github.com/PlanTL-GOB-ES/lm-biomedical-clinical-es). This work was funded by the Spanish State Secretariat for Digitalization and Artificial Intelligence (SEDIA) within the framework of the Plan-TL | Feature | Description | | --- | --- | | **Name** | `es_cantemist_ner_trf` | | **Version** | `3.4.0` | | **spaCy** | `>=3.4.0,<3.5.0` | | **Default Pipeline** | `transformer`, `ner` | | **Components** | `transformer`, `ner` | | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) | | **Sources** | n/a | | **License** | `[Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0)` | | **Author** | [The Text Mining Unit from Barcelona Supercomputing Center.](https://huggingface.co/PlanTL-GOB-ES) | ### Label Scheme
View label scheme (1 labels for 1 components) | Component | Labels | | --- | --- | | **`ner`** | `MORFOLOGIA_NEOPLASIA` |
### Accuracy | Type | Score | | --- | --- | | `ENTS_F` | 84.52 | | `ENTS_P` | 84.88 | | `ENTS_R` | 84.16 | | `TRANSFORMER_LOSS` | 25646.78 | | `NER_LOSS` | 9622.84 |