Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Al-Chan
/
Malicious_Prompt_Classifier
like
0
Text Classification
Transformers
Safetensors
davanstrien/aart-ai-safety-dataset
obalcells/advbench
databricks/databricks-dolly-15k
distilbert
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Malicious & Jailbreaking Prompt Classifer
Datasets Used
Malicious & Jailbreaking Prompt Classifer
Datasets Used
MaliciousInstruct
AART
StrongREJECT
DAN
AdvBench
Databricks-Dolly
Downloads last month
22
Safetensors
Model size
67M params
Tensor type
F32
·
Files info
Inference Providers
NEW
Text Classification
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Datasets used to train
Al-Chan/Malicious_Prompt_Classifier
databricks/databricks-dolly-15k
Viewer
•
Updated
Jun 30, 2023
•
15k
•
16.5k
•
807
davanstrien/aart-ai-safety-dataset
Viewer
•
Updated
Jan 9, 2024
•
3.27k
•
31
•
2
Space using
Al-Chan/Malicious_Prompt_Classifier
1
📚
Al-Chan/Project_Demo