SORRY-Bench

community

https://sorry-bench.github.io

sorry-bench

AI & ML interests

None defined yet.

Organization Card

Community About org cards

SRRY-Bench: Systematically Evaluating LLM Safety Refusal Behaviors

models 1

sorry-bench/ft-mistral-7b-instruct-v0.2-sorry-bench-202406

Text Generation • Updated Jul 2, 2024 • 6.02k • 4

datasets 2

sorry-bench/sorry-bench-human-judgment-202406

Viewer • Updated Jul 2, 2024 • 7.2k • 57 • 5

sorry-bench/sorry-bench-202406

Viewer • Updated Jul 2, 2024 • 9.45k • 711 • 18