patriciacarla commited on
Commit
1c86448
1 Parent(s): 0d4e958

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -29,11 +29,11 @@ This model is a multilingual hate speech classifier based on the XLM-R architect
29
 
30
  ## Training Data
31
 
32
- The model is trained using a multilingual dataset consisting of Twitter and YouTube comments in EN, IT and SL.
33
 
34
  ### Techniques Used
35
 
36
- - **Multilingual Training:** The model is trained on datasets in multiple languages, allowing it to generalize well across different languages.
37
  - **Learning from Disagreement:** The model incorporates techniques to learn from annotator disagreement, improving its ability to handle ambiguous and nuanced cases of hate speech.
38
 
39
  ### Hate Speech Classes
 
29
 
30
  ## Training Data
31
 
32
+ The model is trained on a multilingual dataset consisting of Twitter and YouTube comments in EN, IT and SL. The dataset consists of diamond standard data, i.e. an alternative to the gold standard that takes into account the perspectives of multiple annotators. This is particularly useful for highly subjective tasks such as annotating hate speech, where the idea of a single truth may be debatable.
33
 
34
  ### Techniques Used
35
 
36
+ - **Multilingual Training:** The model is trained on datasets in multiple languages, allowing it to generalize well across different linguistic contexts.
37
  - **Learning from Disagreement:** The model incorporates techniques to learn from annotator disagreement, improving its ability to handle ambiguous and nuanced cases of hate speech.
38
 
39
  ### Hate Speech Classes