ptaszynski
commited on
Commit
•
5663327
1
Parent(s):
0bc3258
Update README.md
Browse files
README.md
CHANGED
@@ -1,16 +1,12 @@
|
|
1 |
---
|
2 |
-
|
3 |
-
|
4 |
-
license: cc-by-sa-4.0
|
5 |
-
|
6 |
datasets:
|
7 |
-
|
8 |
-
|
9 |
-
-
|
10 |
-
|
11 |
-
-
|
12 |
-
-
|
13 |
-
|
14 |
---
|
15 |
|
16 |
# Polbert-CB - Polish BERT trained for Automatic Cyberbullying Detection
|
@@ -65,11 +61,23 @@ Original dataset:
|
|
65 |
|
66 |
Improved dataset:
|
67 |
|
|
|
|
|
|
|
68 |
```
|
69 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
70 |
```
|
71 |
|
72 |
## References
|
73 |
* https://github.com/google-research/bert
|
74 |
* https://github.com/ptaszynski/cyberbullying-Polish
|
75 |
-
* https://huggingface.co/datasets/poleval2019_cyberbullying
|
|
|
1 |
---
|
2 |
+
license: cc-by-4.0
|
|
|
|
|
|
|
3 |
datasets:
|
4 |
+
- ptaszynski/PolishCyberbullyingDataset
|
5 |
+
language:
|
6 |
+
- pl
|
7 |
+
tags:
|
8 |
+
- cyberbullying
|
9 |
+
- hate-speech
|
|
|
10 |
---
|
11 |
|
12 |
# Polbert-CB - Polish BERT trained for Automatic Cyberbullying Detection
|
|
|
61 |
|
62 |
Improved dataset:
|
63 |
|
64 |
+
The improved dataset used for training this model was released as follows.
|
65 |
+
[Expert-annotated dataset to study cyberbullying in Polish language](https://huggingface.co/datasets/ptaszynski/PolishCyberbullyingDataset)
|
66 |
+
|
67 |
```
|
68 |
+
@article{ptaszynski2023expert,
|
69 |
+
title={Expert-Annotated Dataset to Study Cyberbullying in Polish Language},
|
70 |
+
author={Ptaszynski, Michal and Pieciukiewicz, Agata and Dybala, Pawel and Skrzek, Pawel and Soliwoda, Kamil and Fortuna, Marcin and Leliwa, Gniewosz and Wroczynski, Michal},
|
71 |
+
journal={Data},
|
72 |
+
volume={9},
|
73 |
+
number={1},
|
74 |
+
pages={1},
|
75 |
+
year={2023},
|
76 |
+
publisher={MDPI}
|
77 |
+
}
|
78 |
```
|
79 |
|
80 |
## References
|
81 |
* https://github.com/google-research/bert
|
82 |
* https://github.com/ptaszynski/cyberbullying-Polish
|
83 |
+
* https://huggingface.co/datasets/poleval2019_cyberbullying
|