Basic Spacy BioNER pipeline, with a RoBERTa-based model [bsc-bio-ehr-es] (https://huggingface.co/PlanTL-GOB-ES/bsc-bio-ehr-es) and a dataset, Pharmaconer, a NER dataset annotated with substances, compounds and proteins entities. For further information, check the official website. Visit our GitHub repository. This work was funded by the Spanish State Secretariat for Digitalization and Artificial Intelligence (SEDIA) within the framework of the Plan-TL
Label Scheme
View label scheme (4 labels for 1 components)
Component |
Labels |
ner |
NORMALIZABLES , NO_NORMALIZABLES , PROTEINAS , UNCLEAR |
Accuracy
Type |
Score |
ENTS_F |
91.09 |
ENTS_P |
90.67 |
ENTS_R |
91.53 |
TRANSFORMER_LOSS |
15719.51 |
NER_LOSS |
22469.88 |