Zaiyan Xu's picture
2

Zaiyan Xu

diligentotter
·

AI & ML interests

None yet

Organizations

None yet

diligentotter's activity

upvoted 2 articles about 1 month ago
view article
Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

28
view article
Article

Preference Tuning LLMs with Direct Preference Optimization Methods

18