arxiv:2510.20733
Zhuokai Zhao
zhuokai
AI & ML interests
Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System
Recent Activity
authored
a paper
7 days ago
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
authored
a paper
7 days ago
Transfer between Modalities with MetaQueries