Rui Yang

Ray2333
·

AI & ML interests

Deep Reinforcement Learning

Organizations

None yet

Ray2333's activity

New activity in Ray2333/gpt2-large-harmless-reward_model 3 months ago

How to train the model

1
#1 opened 3 months ago by mike2000