Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published Nov 25 • 15