Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper โข 2501.17433 โข Published Jan 29 โข 9
Running on CPU Upgrade 12.7k 12.7k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots