DistilBERT for Text Classification
This repository contains a fine-tuned DistilBERT model for text classification. The model is designed to classify text into four categories: SAFE, JAILBREAK, INJECTION, and PHISHING.
Model Details
- Base model: DistilBERT (distilbert-base-uncased)
- Task: Sequence Classification
- Number of labels: 4
- Labels: SAFE, JAILBREAK, INJECTION, PHISHING
Usage
To use this model, you can leverage the Hugging Face Transformers library: