Ranran zhen
zenRRan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 9 hours ago
ToolRL: Reward is All Tool Learning Needs
upvoted
a
paper
about 9 hours ago
OTC: Optimal Tool Calls via Reinforcement Learning
upvoted
a
paper
3 months ago
Test-time Computing: from System-1 Thinking to System-2 Thinking
Organizations
None yet
models
None public yet
datasets
None public yet