zpysky1125's picture

3 1

zpysky1125

pyzhao

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

liked a model 3 months ago

MiniMaxAI/MiniMax-Text-01

upvoted a paper 3 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

pyzhao's activity

upvoted a paper 17 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 20 days ago • 43

liked a model 3 months ago

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated 7 days ago • 5.57k • 570

upvoted a paper 3 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99

upvoted an article 11 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

• 85