yuxudong's picture

1

yuxudong

xudong20

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

authored a paper over 1 year ago

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness

View all activity

Organizations

xudong20's activity

upvoted a paper about 2 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 149

authored a paper over 1 year ago

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness

Paper • 2309.16973 • Published Sep 29, 2023