yuqi yang's picture

2

yuqi yang

tzteyang

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

authored a paper 3 days ago

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

upvoted a paper 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

None yet

tzteyang's activity

upvoted a paper 1 day ago

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Paper • 2503.06580 • Published 5 days ago • 14

authored a paper 3 days ago

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Paper • 2503.06580 • Published 5 days ago • 14

upvoted a paper 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 92

authored a paper 3 months ago

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 44