--- title: Hate Speech Text Classifier emoji: 👁 colorFrom: green colorTo: red sdk: gradio sdk_version: 4.21.0 app_file: app.py pinned: false --- # Monitoring Harmful Text in Online Platforms ## Overview This repository hosts the RandomForest classifier model designed for detecting harmful text AGAINST GROUPS. The model classifies text into one of three categories: "Offensive or Hateful", "Neutral or Ambiguous", and "Not Hate". Achieving an accuracy of 92.5%, this model was developed through the combination of three distinct datasets, ensuring robustness and reliability in varied contexts. It was presented at the prestigious annual Gulf Coast Conference & Expo on AI. Model Details Model Type: RandomForest Classifier Accuracy: 92.5% Labels: 0: Neutral or Ambiguous 1: Not Hate 2: Offensive or Hateful Training Data: Augmented version of [this dataset](TLeonidas/twitter-hate-speech-en-240ksamples) (279k+ rows)