Directional Preference Alignment

classroom

AI & ML interests

None defined yet.

Recent Activity

weqweasdas authored a paper 7 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

weqweasdas authored a paper 7 months ago

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

weqweasdas authored a paper 7 months ago

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

View all activity

directional-preference-alignment's activity

weqweasdas

authored 4 papers 7 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 66

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

Paper • 2312.11456 • Published Dec 18, 2023 • 1

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

Paper • 2306.12420 • Published Jun 21, 2023 • 2

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

Paper • 2304.06767 • Published Apr 13, 2023 • 2

Haoxiang-Wang

authored a paper 7 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 66

Haoxiang-Wang

authored a paper about 1 year ago

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

Paper • 2310.15308 • Published Oct 23, 2023 • 22