Aviral Kumar
aviralkumar
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
authored
a paper
6 months ago
Training Language Models to Self-Correct via Reinforcement Learning
authored
a paper
7 months ago
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Organizations
None yet
aviralkumar's activity
No public activity