AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs Paper • 2410.05295 • Published Oct 3, 2024 • 12
JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks Paper • 2404.03027 • Published Apr 3, 2024 • 3
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published Jun 13, 2024 • 20
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models Paper • 2310.04451 • Published Oct 3, 2023
JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks Paper • 2404.03027 • Published Apr 3, 2024 • 3
Running on CPU Upgrade 12.5k 12.5k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots