15 23 16

Yuzhen Huang

yuzhen17

https://hyz17.github.io

HYZ17

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Video-R1: Reinforcing Video Reasoning in MLLMs

upvoted a paper 5 days ago

Expanding RL with Verifiable Rewards Across Diverse Domains

upvoted a paper 6 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

View all activity

Organizations

yuzhen17's activity

upvoted a paper 4 days ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published 10 days ago • 75

upvoted a paper 5 days ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published 7 days ago • 17

upvoted a paper 6 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 10 days ago • 43

upvoted 2 papers 13 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 13 days ago • 28

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 17 days ago • 46

upvoted 2 papers about 1 month ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 190

upvoted a paper about 2 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 151

upvoted 3 papers 3 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 48

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 364

upvoted a paper 5 months ago

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 77

upvoted 2 papers 7 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 26

Learning to Move Like Professional Counter-Strike Players

Paper • 2408.13934 • Published Aug 25, 2024 • 23

upvoted 3 papers 8 months ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 60

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 100

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 71

upvoted a paper 9 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 162