File size: 984 Bytes
5d8d9f5
 
 
6ec0fe1
5d8d9f5
6ec0fe1
5d8d9f5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1c29034
 
5d8d9f5
 
 
 
6ec0fe1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
language:
- da
license: apache-2.0
widget:
- text: Senile gamle idiot
---

# Danish BERT for hate speech (offensive language) detection

The BERT HateSpeech model detects whether a Danish text is offensive or not. 
It is based on the pretrained [Danish BERT](https://github.com/certainlyio/nordic_bert) model by BotXO which has been fine-tuned on social media data. 

See the [DaNLP documentation](https://danlp-alexandra.readthedocs.io/en/latest/docs/tasks/hatespeech.html#bertdr) for more details. 


Here is how to use the model:

```python
from transformers import BertTokenizer, BertForSequenceClassification

model = BertForSequenceClassification.from_pretrained("alexandrainst/da-hatespeech-detection-base")
tokenizer = BertTokenizer.from_pretrained("alexandrainst/da-hatespeech-detection-base")
```

## Training data

The data used for training has not been made publicly available. It consists of social media data manually annotated in collaboration with Danmarks Radio.