wenlong deng's picture

wenlong deng

dwenlong

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models

submitted a paper 5 days ago

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models

upvoted a paper 24 days ago

Privileged Information Distillation for Language Models

View all activity

Organizations

dwenlong 's papers 6

arxiv:2603.12634

arxiv:2602.00344

arxiv:2512.04220

arxiv:2510.03669

arxiv:2504.00993

arxiv:2410.09344