PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published 24 days ago • 9
Representation Learning in Continuous-Time Dynamic Signed Networks Paper • 2207.03408 • Published Jul 7, 2022
Chain-of-Thought Reasoning is a Policy Improvement Operator Paper • 2309.08589 • Published Sep 15, 2023 • 1
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models Paper • 2402.14688 • Published Feb 22, 2024
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning Paper • 2406.04520 • Published Jun 6, 2024 • 12
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet Paper • 2408.15221 • Published Aug 27, 2024