arxiv:2410.01769
Zhenting Qi
zhenting
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
22 days ago
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit
Assignment
Organizations
models
None public yet
datasets
None public yet