File size: 904 Bytes
c6fb38b 29d077a c6fb38b 6a445c4 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
---
language: en
tag: text-classification
datasets:
- twitter
- movies subtitles
---
# bert-based-uncased-hatespeech-movies:
A hatespeech model used to classify text as **normal**, **offensive**, **hatespeech** in Movie subtitles. The model is initially a pre-trained transformer model(bert-based-uncased) which is further trained on Twitter comments which can be normal, offensive and hate to learn the context from social media data. It is then fine-tuned using the movie subtitles dataset.
Please check our paper and if used please cite
```
@article{von2021hateful,
title={How Hateful are Movies? A Study and Prediction on Movie Subtitles},
author={von Boguszewski, Niklas and Moin, Sana and Bhowmick, Anirban and Yimam, Seid Muhie and Biemann, Chris},
journal={arXiv preprint arXiv:2108.10724},
year={2021}
}
```
The dataset and models are available on https://github.com/uhh-lt/hatespeech |