YUYI YANG's picture

3

YUYI YANG

yyuyi

·

yyuyi

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

upvoted a paper 14 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

upvoted a paper 16 days ago

Process Rewards with Learned Reliability

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet