Hanchi Sun's picture

1 2 4

Hanchi Sun

MasterGodzilla

·

MasterGodzilla

AI & ML interests

None yet

Recent Activity

published a model about 2 months ago

MasterGodzilla/Qwen2.5-0.5B-Open-R1-GRPO

published a model about 2 months ago

MasterGodzilla/Qwen2.5-1.5B-Open-R1-GRPO

upvoted a paper 5 months ago

HelpSteer2-Preference: Complementing Ratings with Preferences

View all activity

Organizations

None yet

MasterGodzilla's activity

New activity in HuggingFaceH4/zephyr-7b-beta over 1 year ago

Why not use the Plackett-Luce Model version of DPO when K=4 ranked responses are present?

#18 opened over 1 year ago by

Why not use the Plackett-Luce Model version of DPO when K=4 ranked responses are present?

#18 opened over 1 year ago by