Model Details

Classifier-Bias-TahniatKhan is a prototype model crafted to classify content into two categories: "Biased" and "Non-Biased".

Model Architecture

The model is built upon the distilbert-base-uncased architecture and has been fine-tuned on a custom dataset for the specific task of bias detection.

Dataset

The model was trained on a BABE dataset containing news articles from various sources, annotated with one of the 2 bias levels. Biased_Text = 1810 UnBiased_Test=1810

Training Procedure

The model was trained using the Adam optimizer for 6 epochs. Performance On our validation set, the model achieved: Accuracy: 78% F1 Score (Biased): 79% F1 Score (Non-Biased): 78%

How to Use

To use this model for text classification, use the following code

from transformers import pipeline
from transformers import AutoTokenizer, AutoModelForSequenceClassification
 
tokenizer = AutoTokenizer.from_pretrained("/Social-Media-Fairness/Tahniat-Classifier")
model = AutoModelForSequenceClassification.from_pretrained("Social-Media-Fairness/Tahniat-Classifier")
classifier = pipeline("text-classification", model=model, tokenizer=tokenizer)
result = classifier("you are stupid")
print(result)

Caveats and Limitations

The model's training data originates from a specific dataset (BABE) which might not represent all kinds of biases or content. The performance metrics are based on a random validation split, so the model's performance might vary in real-world applications.

Developed by Tahniat Khan

Downloads last month
13
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.