Overview
Our model is designed to identify malicious prompts that can be used to generate phishing websites and emails using popular commercial LLMs like ChatGPT and Gemini. This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts.
Dataset Details
The dataset utilized for training this model has been created using malicious prompts generated by GPT-4.
Training Details
The model was trained using RobertaForSequenceClassification.from_pretrained. In this process, both the model and tokenizer pertinent to the RoBERTa-base were employed and trained for 10 epochs.
- Downloads last month
- 0
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.