metadata
license: other
license_name: ihtsdo-and-nlm-licences
license_link: https://www.nlm.nih.gov/databases/umls.html
language:
- nl
- en
library_name: sentence-transformers
tags:
- medical
- biology
pipeline_tag: sentence-similarity
widget:
- source_sentence: bartonellosis
sentences:
- kattenkrabziekte
- wond, kattenkrab
- door teken overgedragen orbiviruskoorts
- kattenbont
In-Context Dutch Clinical Embeddings with BioLORD & MedMentions
Do mentions sharing the same text need to have the same embedding? No!
This model supports embedding biomedical entities in both English and Dutch, but support in-context embedding of concepts, using the following template:
mention text [SEP] (context: ... a textual example containing mention text and some more text on both sides ...)
It also supports embedding mentions without context, particularly in English.
References
📖 BioLORD-2023: semantic textual representations fusing large language models and clinical knowledge graph insights
Journal of the American Medical Informatics Association, 2024
François Remy, Kris Demuynck, Thomas Demeester
view online
📖 Annotation-preserving machine translation of English corpora to validate Dutch clinical concept extraction tools
Under review, with a preprint available on Medrxiv.org, 2024
Tom Seinen, Jan Kors, Erik van Mulligen, Peter Rijnbeek
view online