Danish BERT (version 2, uncased) by Certainly (previously known as BotXO) finetuned for Named Entity Recognition on the DaNE dataset (Hvingelby et al., 2020) by Malte Højmark-Bertelsen.

Humongous amounts of credit needs to go to Certainly (previously known as BotXO), for pretraining the Danish BERT. For data and training details see their GitHub repository or this article. You can also visit their organization page on Hugging Face.

It is both available in TensorFlow and Pytorch format. The original TensorFlow version can be downloaded using this link.

Here is an example on how to load Danish BERT for token classification in PyTorch using the 🤗Transformers library:

from transformers import AutoTokenizer, AutoModelForTokenClassification

tokenizer = AutoTokenizer.from_pretrained("Maltehb/danish-bert-botxo-ner-dane")
model = AutoModelForTokenClassification.from_pretrained("Maltehb/danish-bert-botxo-ner-dane")

References

Danish BERT. (2020). BotXO. https://github.com/botxo/nordic_bert (Original work published 2019)

Hvingelby, R., Pauli, A. B., Barrett, M., Rosted, C., Lidegaard, L. M., & Søgaard, A. (2020). DaNE: A Named Entity Resource for Danish. Proceedings of the 12th Language Resources and Evaluation Conference, 4597–4604. https://www.aclweb.org/anthology/2020.lrec-1.565

Contact

For help or further information feel free to connect with the author Malte Højmark-Bertelsen on hjb@kmd.dk or any of the following platforms:

MalteHB | Twitter MalteHB | LinkedIn MalteHB | Instagram


Downloads last month
266
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train Maltehb/danish-bert-botxo-ner-dane