Weijing Huang's picture

3 8 40

Weijing Huang

waleking

AI & ML interests

Language Models

Recent Activity

upvoted a paper 14 days ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

liked a dataset about 2 months ago

OpenStellarTeam/Chinese-SimpleQA

liked a dataset about 2 months ago

allenai/olmOCR-mix-0225

View all activity

Organizations

None yet

waleking's activity

upvoted a paper 14 days ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published 17 days ago • 25

upvoted a paper 2 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 49

upvoted 2 papers 3 months ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published Feb 5 • 17

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99