ymh233's picture

2 4 15

ymh233

ymh233

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

upvoted a paper 29 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

upvoted a paper about 2 months ago

Process-based Self-Rewarding Language Models

View all activity

Organizations

ymh233's activity

upvoted a paper 14 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 15 days ago • 44

upvoted a paper 29 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 29 days ago • 30

upvoted a paper about 2 months ago

Process-based Self-Rewarding Language Models

Paper • 2503.03746 • Published Mar 5 • 40

upvoted a paper 3 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 48