NbAiLab /nb-bert-base

• Release 1.1 (March 11, 2021)
• Release 1.0 (January 13, 2021)

NB-BERT-base

Description

NB-BERT-base is a general BERT-base model built on the large digital collection at the National Library of Norway.

This model is based on the same structure as BERT Cased multilingual model, and is trained on a wide variety of Norwegian text (both bokmål and nynorsk) from the last 200 years.

Intended use & limitations

The 1.1 version of the model is general, and should be fine-tuned for any particular use. Some fine-tuning sets may be found on GitHub, see

Training data

The model is trained on a wide variety of text. The training set is described on

Mask token: [MASK]