patriciacarla
commited on
Commit
•
1c86448
1
Parent(s):
0d4e958
Update README.md
Browse files
README.md
CHANGED
@@ -29,11 +29,11 @@ This model is a multilingual hate speech classifier based on the XLM-R architect
|
|
29 |
|
30 |
## Training Data
|
31 |
|
32 |
-
The model is trained
|
33 |
|
34 |
### Techniques Used
|
35 |
|
36 |
-
- **Multilingual Training:** The model is trained on datasets in multiple languages, allowing it to generalize well across different
|
37 |
- **Learning from Disagreement:** The model incorporates techniques to learn from annotator disagreement, improving its ability to handle ambiguous and nuanced cases of hate speech.
|
38 |
|
39 |
### Hate Speech Classes
|
|
|
29 |
|
30 |
## Training Data
|
31 |
|
32 |
+
The model is trained on a multilingual dataset consisting of Twitter and YouTube comments in EN, IT and SL. The dataset consists of diamond standard data, i.e. an alternative to the gold standard that takes into account the perspectives of multiple annotators. This is particularly useful for highly subjective tasks such as annotating hate speech, where the idea of a single truth may be debatable.
|
33 |
|
34 |
### Techniques Used
|
35 |
|
36 |
+
- **Multilingual Training:** The model is trained on datasets in multiple languages, allowing it to generalize well across different linguistic contexts.
|
37 |
- **Learning from Disagreement:** The model incorporates techniques to learn from annotator disagreement, improving its ability to handle ambiguous and nuanced cases of hate speech.
|
38 |
|
39 |
### Hate Speech Classes
|