Zhaolin Gao

GitBag

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a dataset 7 minutes ago
GitBag/1744494293
updated a dataset 10 minutes ago
GitBag/1744494214
updated a dataset 14 minutes ago
GitBag/1744493997
View all activity

Organizations

Cornell-AGI's profile picture

Articles 1

Article
6

RLHF 101: A Technical Dive into RLHF