DavidAU/OpenAi-GPT-oss-20b-abliterated-uncensored-NEO-Imatrix-gguf Text Generation • 21B • Updated 24 days ago • 127k • 338
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29 • 58
The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections Paper • 2510.09023 • Published Oct 10 • 9
When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs Paper • 2508.03365 • Published Aug 5 • 4 • 2
MemeSafetyBench Collection [EMNLP'25] A Benchmark for Assessing VLM Safety with Real-World Memes • 2 items • Updated Sep 9