Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification Paper • 2512.16921 • Published Dec 18, 2025 • 9
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Paper • 2606.03979 • Published 10 days ago • 29
How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum Paper • 2604.25907 • Published Apr 28 • 4
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards Paper • 2605.10899 • Published May 11 • 79