Fabian S's picture

1 1

Fabian S

fabian-sp

·

https://fabian-sp.github.io/

AI & ML interests

Optimization for ML

Recent Activity

authored a paper about 2 months ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

commented on a paper about 2 months ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

authored a paper 12 months ago

SGD with Clipping is Secretly Estimating the Median Gradient

View all activity

Organizations

None yet

fabian-sp's activity

authored a paper about 2 months ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published Jan 31 • 7

commented a paper about 2 months ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published Jan 31 • 7 •

authored a paper 12 months ago

SGD with Clipping is Secretly Estimating the Median Gradient

Paper • 2402.12828 • Published Feb 20, 2024

updated a collection about 1 year ago

Descent Only

Papers, posts and resources related to optimization for ML. • 6 items • Updated Mar 13, 2024

liked a model about 1 year ago

microsoft/SatCLIP-ViT16-L10

Updated Jan 16, 2024 • 3

updated a collection about 1 year ago

Descent Only

Papers, posts and resources related to optimization for ML. • 6 items • Updated Mar 13, 2024

authored a paper about 1 year ago

MoMo: Momentum Models for Adaptive Learning Rates

Paper • 2305.07583 • Published May 12, 2023

updated a collection about 1 year ago

Descent Only

Papers, posts and resources related to optimization for ML. • 6 items • Updated Mar 13, 2024