Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
PKU-Alignment
university
https://github.com/PKU-Alignment
PKU-Alignment
Request to join this org
AI & ML interests
Reinforcement Learning, Large Language Models, Value Alignment
Team members
6
models
13
Sort: Recently updated
PKU-Alignment/beaver-7b-v3.0
Reinforcement Learning
•
Updated
12 days ago
•
65
PKU-Alignment/beaver-7b-v2.0
Reinforcement Learning
•
Updated
12 days ago
•
13
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
12 days ago
•
193
•
7
PKU-Alignment/alpaca-7b-reproduced
Updated
12 days ago
•
7.98k
•
2
PKU-Alignment/beaver-7b-unified-reward
Reinforcement Learning
•
Updated
12 days ago
•
212
PKU-Alignment/beaver-7b-unified-cost
Reinforcement Learning
•
Updated
12 days ago
•
209
PKU-Alignment/beaver-7b-v3.0-reward
Reinforcement Learning
•
Updated
12 days ago
•
805
PKU-Alignment/beaver-7b-v3.0-cost
Reinforcement Learning
•
Updated
12 days ago
•
733
PKU-Alignment/beaver-7b-v2.0-reward
Reinforcement Learning
•
Updated
12 days ago
•
1
PKU-Alignment/beaver-7b-v2.0-cost
Reinforcement Learning
•
Updated
12 days ago
•
1
Expand 13 models
datasets
7
Sort: Recently updated
PKU-Alignment/processed-hh-rlhf
Viewer
•
Updated
Nov 24, 2023
•
5.5k
•
8
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Nov 20, 2023
•
16.5k
•
66
PKU-Alignment/PKU-SafeRLHF-30K
Viewer
•
Updated
Nov 20, 2023
•
2.18k
•
3
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
14.7k
•
20
PKU-Alignment/BeaverTails-single-dimension-preference
Viewer
•
Updated
Aug 18, 2023
•
59
PKU-Alignment/PKU-SafeRLHF-10K
Viewer
•
Updated
Jul 20, 2023
•
7.27k
•
56
PKU-Alignment/BeaverTails-Evaluation
Viewer
•
Updated
Jul 20, 2023
•
231
•
3