Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
PKU-Alignment
university
https://github.com/PKU-Alignment
PKU-Alignment
Request to join this org
AI & ML interests
Reinforcement Learning, Large Language Models, Value Alignment
Team members
8
models
15
Sort: Recently updated
PKU-Alignment/alpaca-8b-reproduced-llama-3
Updated
20 days ago
•
120
PKU-Alignment/alpaca-7b-reproduced-llama-2
Updated
20 days ago
•
324
PKU-Alignment/beaver-7b-v3.0
Reinforcement Learning
•
Updated
20 days ago
•
168
PKU-Alignment/beaver-7b-v2.0
Reinforcement Learning
•
Updated
20 days ago
•
262
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
20 days ago
•
689
•
7
PKU-Alignment/alpaca-7b-reproduced
Updated
20 days ago
•
16.2k
•
2
PKU-Alignment/beaver-7b-unified-reward
Reinforcement Learning
•
Updated
Apr 20
•
1.15k
PKU-Alignment/beaver-7b-unified-cost
Reinforcement Learning
•
Updated
Apr 20
•
387
PKU-Alignment/beaver-7b-v3.0-reward
Reinforcement Learning
•
Updated
Apr 20
•
2.65k
PKU-Alignment/beaver-7b-v3.0-cost
Reinforcement Learning
•
Updated
Apr 20
•
2.05k
Expand 15 models
datasets
7
Sort: Recently updated
PKU-Alignment/processed-hh-rlhf
Viewer
•
Updated
Nov 24, 2023
•
5.69k
•
8
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Nov 20, 2023
•
119k
•
74
PKU-Alignment/PKU-SafeRLHF-30K
Viewer
•
Updated
Nov 20, 2023
•
4.71k
•
3
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
13.8k
•
26
PKU-Alignment/BeaverTails-single-dimension-preference
Viewer
•
Updated
Aug 18, 2023
•
26
PKU-Alignment/PKU-SafeRLHF-10K
Viewer
•
Updated
Jul 20, 2023
•
890
•
56
PKU-Alignment/BeaverTails-Evaluation
Viewer
•
Updated
Jul 20, 2023
•
273
•
3