Dataset and RMU model weights for LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
View all Papers
datasets
16
ScaleAI/researchrubrics
Viewer
•
Updated
•
101
•
6
•
1
ScaleAI/swe-oec-claude-expert
Viewer
•
Updated
•
1.27k
•
86
•
1
ScaleAI/VisualToolBench
Viewer
•
Updated
•
1.19k
•
144
•
1
ScaleAI/TutorBench
Viewer
•
Updated
•
1.47k
•
107
ScaleAI/SWE-bench_Pro
Viewer
•
Updated
•
731
•
12.1k
•
35
ScaleAI/BioRiskEval
Viewer
•
Updated
•
156k
•
51
ScaleAI/TutorBench_sample
Viewer
•
Updated
•
30
•
34
ScaleAI/mrt
Updated
•
446
•
3
ScaleAI/stc
Updated
•
7
ScaleAI/fortress_public
Viewer
•
Updated
•
500
•
2.12k
•
2