Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Liang
Ren-Wei
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-opt-1b
updated
a model
about 1 month ago
Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-3b
updated
a model
about 1 month ago
Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-1b
View all activity
Organizations
None yet
models
18
Sort: Recently updated
Ren-Wei/Safe-RLHF-SFT-mist-7b
Updated
1 day ago
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-opt-1b
Updated
Dec 9, 2024
•
7
Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-3b
Updated
Dec 9, 2024
•
6
Ren-Wei/Safe-RLHF-DPO-naive-baseline-opt-1b
Updated
Dec 9, 2024
•
10
Ren-Wei/Safe-RLHF-PPO-helpless-opt-1b
Updated
Dec 4, 2024
•
4
Ren-Wei/Safe-RLHF-PPO-harmless-opt-1b
Updated
Dec 4, 2024
•
3
Ren-Wei/Safe-RLHF-DPO-helpless-opt-3b
Updated
Dec 3, 2024
•
8
Ren-Wei/Safe-RLHF-DPO-helpful-opt-3b
Updated
Dec 3, 2024
•
18
Ren-Wei/Safe-RLHF-DPO-harmless-opt-3b
Updated
Dec 3, 2024
•
8
Ren-Wei/Safe-RLHF-DPO-harmful-opt-3b
Updated
Dec 3, 2024
•
8
Expand 18 models
datasets
None public yet