helem-llm / README.md
menimeni123's picture
latest
180d0f0

DistilBERT for Text Classification

This repository contains a fine-tuned DistilBERT model for text classification. The model is designed to classify text into four categories: SAFE, JAILBREAK, INJECTION, and PHISHING.

Model Details

  • Base model: DistilBERT (distilbert-base-uncased)
  • Task: Sequence Classification
  • Number of labels: 4
  • Labels: SAFE, JAILBREAK, INJECTION, PHISHING

Usage

To use this model, you can leverage the Hugging Face Transformers library: