Zaiyan Xu's picture

2

Zaiyan Xu

diligentotter

·

https://www.zaiyanxu.com

zaiyan-x

AI & ML interests

None yet

Organizations

None yet

diligentotter's activity

upvoted 2 articles 10 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 157

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18, 2024

• 45