Aviral Kumar's picture

1

Aviral Kumar

aviralkumar

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

authored a paper 6 months ago

Training Language Models to Self-Correct via Reinforcement Learning

authored a paper 7 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

View all activity

Organizations

None yet

Papers 7

arxiv:2503.07572

arxiv:2409.12917

arxiv:2408.03314

arxiv:2406.11896

models

None public yet

datasets

None public yet