Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published 17 days ago • 38
Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework Paper • 2503.10704 • Published Mar 12 • 5
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Paper • 2410.07137 • Published Oct 9, 2024 • 7
Improving Long-Text Alignment for Text-to-Image Diffusion Models Paper • 2410.11817 • Published Oct 15, 2024 • 15
Efficient Diffusion Policies for Offline Reinforcement Learning Paper • 2305.20081 • Published May 31, 2023 • 2
Bag of Tricks for Training Data Extraction from Language Models Paper • 2302.04460 • Published Feb 9, 2023 • 2
Better Diffusion Models Further Improve Adversarial Training Paper • 2302.04638 • Published Feb 9, 2023 • 1
On Evaluating Adversarial Robustness of Large Vision-Language Models Paper • 2305.16934 • Published May 26, 2023
Exploring Model Dynamics for Accumulative Poisoning Discovery Paper • 2306.03726 • Published Jun 6, 2023
Intriguing Properties of Data Attribution on Diffusion Models Paper • 2311.00500 • Published Nov 1, 2023 • 2
Locality Sensitive Sparse Encoding for Learning World Models Online Paper • 2401.13034 • Published Jan 23, 2024
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Paper • 2402.08567 • Published Feb 13, 2024 • 2
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses Paper • 2406.01288 • Published Jun 3, 2024 • 1
Bootstrapping Language Models with DPO Implicit Rewards Paper • 2406.09760 • Published Jun 14, 2024 • 41