biosyn-sapbert-bc5cdr-disease

Biomedical Entity Mention Linking for disease:

Demo: How to use in Flair

Requires:

  • Flair>=0.14.0 (pip install flair or pip install git+https://github.com/flairNLP/flair.git)
from flair.data import Sentence
from flair.models import Classifier, EntityMentionLinker
from flair.tokenization import SciSpacyTokenizer

sentence = Sentence(
    "The mutation in the ABCD1 gene causes X-linked adrenoleukodystrophy, "
    "a neurodegenerative disease, which is exacerbated by exposure to high "
    "levels of mercury in dolphin populations.",
    use_tokenizer=SciSpacyTokenizer()
)

# load hunflair to detect the entity mentions we want to link.
tagger = Classifier.load("hunflair-disease")
tagger.predict(sentence)

# load the linker and dictionary
linker = EntityMentionLinker.load("disease-linker")
linker.predict(sentence)

# print the results for each entity mention:
for span in sentence.get_spans(tagger.label_type):
    for link in span.get_labels(linker.label_type):
      print(f"{span.text} -> {link.value}")

As an alternative to downloading the already precomputed model (much storage). You can also build the model and compute the embeddings for the dataset using:

linker = EntityMentionLinker.build("dmis-lab/biosyn-sapbert-bc5cdr-disease", dictionary_name_or_path="ctd-diseases", hybrid_search=True)

This will reduce the download requirements, at the cost of computation.

Downloads last month
141
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.