Liang's picture

Liang

Ren-Wei

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

Ren-Wei/Safe-RLHF-PPO-Lag-baseline-opt-1b

updated a model about 1 month ago

Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-3b

updated a model about 1 month ago

Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-1b

View all activity

Organizations

None yet

models 18

Ren-Wei/Safe-RLHF-SFT-mist-7b

Updated 1 day ago

Ren-Wei/Safe-RLHF-PPO-Lag-baseline-opt-1b

Updated Dec 9, 2024 • 7

Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-3b

Updated Dec 9, 2024 • 6

Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-1b

Updated Dec 9, 2024 • 10

Ren-Wei/Safe-RLHF-PPO-helpless-opt-1b

Updated Dec 4, 2024 • 4

Ren-Wei/Safe-RLHF-PPO-harmless-opt-1b

Updated Dec 4, 2024 • 3

Ren-Wei/Safe-RLHF-DPO-helpless-opt-3b

Updated Dec 3, 2024 • 8

Ren-Wei/Safe-RLHF-DPO-helpful-opt-3b

Updated Dec 3, 2024 • 18

Ren-Wei/Safe-RLHF-DPO-harmless-opt-3b

Updated Dec 3, 2024 • 8

Ren-Wei/Safe-RLHF-DPO-harmful-opt-3b

Updated Dec 3, 2024 • 8

datasets

None public yet