TLeonidas's picture
Update README.md
6aa334a verified
|
raw
history blame
929 Bytes
---
title: Hate Speech Text Classifier
emoji: πŸ‘
colorFrom: green
colorTo: red
sdk: gradio
sdk_version: 4.21.0
app_file: app.py
pinned: false
---
# Monitoring Harmful Text in Online Platforms
## Overview
This repository hosts the RandomForest classifier model designed for detecting harmful text AGAINST GROUPS.
The model classifies text into one of three categories: "Offensive or Hateful", "Neutral or Ambiguous", and "Not Hate".
Achieving an accuracy of 92.5%, this model was developed through the combination of three distinct datasets, ensuring robustness and reliability in varied contexts.
It was presented at the prestigious annual Gulf Coast Conference & Expo on AI.
Model Details
Model Type: RandomForest Classifier
Accuracy: 92.5%
Labels:
0: Neutral or Ambiguous
1: Not Hate
2: Offensive or Hateful
Training Data: Augmented version of [this dataset](TLeonidas/twitter-hate-speech-en-240ksamples) (279k+ rows)