Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# KB-BERT for NER
|
2 |
+
|
3 |
+
## Mixed cased and uncased data
|
4 |
+
|
5 |
+
This model is based on [KB-BERT](https://huggingface.co/KB/bert-base-swedish-cased) and was fine-tuned on the [SUC 3.1](https://huggingface.co/datasets/KBLab/suc3_1) corpus, using the _simple_ tags and partially lowercased data.
|
6 |
+
For this model we used a variation of the data that did **not** use BIO-encoding to differentiate between the beginnings (B), and insides (I) of named entity tags.
|
7 |
+
|
8 |
+
The model was trained on the training data only, with the best model chosen by its performance on the validation data.
|
9 |
+
You find more information about the model and the performance on our blog: https://kb-labb.github.io/posts/2022--02-07-suc31
|