nb-bert-base / README.md
versae's picture
Typo
4f8b165
metadata
language: 'no'
license: CC-BY 4.0
tags:
  - norwegian
  - bert
thumbnail: nblogo_3.png
pipeline_tag: fill-mask
widget:
  - text:  biblioteket kan du låne [MASK].
  • Release 1.1 (March 11, 2021)
  • Release 1.0 (January 13, 2021)

NB-BERT-base

Description

NB-BERT-base is a general BERT-base model built on the large digital collection at the National Library of Norway.

This model is based on the same structure as BERT Cased multilingual model, and is trained on a wide variety of Norwegian text (both bokmål and nynorsk) from the last 200 years.

Intended use & limitations

The 1.1 version of the model is general, and should be fine-tuned for any particular use. Some fine-tuning sets may be found on GitHub, see

Training data

The model is trained on a wide variety of text. The training set is described on

More information

For more information on the model, see

https://github.com/NBAiLab/notram