APS Jailbreak
Collection
M/LLM jailbroken by Adaptive Probe-based Steering. Remember trust_remote_code! 50 pairs of contrastive prompts only. You can do better with more. • 13 items • Updated
Roughly achieve 90%+ StrongReject Scores. APS paper
Trained by https://github.com/YuanBoXie/DeepRefusal