Logo BioBERTpt

BioBERTpt - Portuguese Clinical and Biomedical BERT

The BioBERTpt - A Portuguese Neural Language Model for Clinical Named Entity Recognition paper contains clinical and biomedical BERT-based models for Portuguese Language, initialized with BERT-Multilingual-Cased & trained on clinical notes and biomedical literature.

This model card describes the BioBERTpt(all) model, a full version with clinical narratives and biomedical literature in Portuguese language.

How to use the model

Load the model via the transformers library:

from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("pucpr/biobertpt-all")
model = AutoModel.from_pretrained("pucpr/biobertpt-all")

More Information

Refer to the original paper, BioBERTpt - A Portuguese Neural Language Model for Clinical Named Entity Recognition for additional details and performance on Portuguese NER tasks.

Questions?

Post a Github issue on the BioBERTpt repo.

New: fine-tune this model in a few clicks by selecting AutoNLP in the "Train" menu!
Downloads last month
523
Hosted inference API
Fill-Mask
Mask token: [MASK]
This model can be loaded on the Inference API on-demand.