Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Aviral Kumar
aviralkumar
Follow
21world's profile picture
sabarirajan's profile picture
iamamanpandey's profile picture
3 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
authored
a paper
6 months ago
Training Language Models to Self-Correct via Reinforcement Learning
authored
a paper
7 months ago
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
View all activity
Organizations
None yet
Papers
7
arxiv:
2503.07572
arxiv:
2409.12917
arxiv:
2408.03314
arxiv:
2406.11896
Expand 7 papers
models
None public yet
datasets
None public yet