Safe/unsafe categories

#4
by python-processing-unit - opened

What categories of prompts does it classify as unsafe?

Your examples included weapons manufacturing, hacking, and self-harm.

All of the prompts in the dataset.

LH-Tech-AI changed discussion status to closed

Sign up or log in to comment