Huaijie Wang's picture

2 1

Huaijie Wang

jwhj

·

AI & ML interests

None yet

Organizations

None yet

jwhj's activity

upvoted a paper 4 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 39