patriciacarla
/

HS-multilingual-DNR

Text Classification

hate-speech-detection

Inference Endpoints

Model card Files Files and versions Community

patriciacarla commited on Aug 2

Commit

1c86448

•

1 Parent(s): 0d4e958

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -29,11 +29,11 @@ This model is a multilingual hate speech classifier based on the XLM-R architect
 ## Training Data
-The model is trained using a multilingual dataset consisting of Twitter and YouTube comments in EN, IT and SL.
 ### Techniques Used
-- **Multilingual Training:** The model is trained on datasets in multiple languages, allowing it to generalize well across different languages.
 - **Learning from Disagreement:** The model incorporates techniques to learn from annotator disagreement, improving its ability to handle ambiguous and nuanced cases of hate speech.
 ### Hate Speech Classes

 ## Training Data
+The model is trained on a multilingual dataset consisting of Twitter and YouTube comments in EN, IT and SL. The dataset consists of diamond standard data, i.e. an alternative to the gold standard that takes into account the perspectives of multiple annotators. This is particularly useful for highly subjective tasks such as annotating hate speech, where the idea of a single truth may be debatable.
 ### Techniques Used
+- **Multilingual Training:** The model is trained on datasets in multiple languages, allowing it to generalize well across different linguistic contexts.
 - **Learning from Disagreement:** The model incorporates techniques to learn from annotator disagreement, improving its ability to handle ambiguous and nuanced cases of hate speech.
 ### Hate Speech Classes