Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
PKU-Alignment
/
beaver-7b-v1.0-cost
like
6
Reinforcement Learning
Safetensors
PKU-Alignment/PKU-SafeRLHF
English
safe-rlhf
llama
reinforcement-learning-from-human-feedback
beaver
safety
ai-safety
deepspeed
rlhf
alpaca
arxiv:
2302.13971
arxiv:
2307.04657
arxiv:
2310.12773
Model card
Files
Files and versions
Community
1
Train
main
beaver-7b-v1.0-cost
2 contributors
History:
9 commits
XuehaiPan
Update README.md
c1bd343
19 days ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
3.41 kB
Update README.md
19 days ago
config.json
795 Bytes
Convert model checkpoint to safetensors
20 days ago
model-00001-of-00007.safetensors
1.98 GB
LFS
Convert model checkpoint to safetensors
20 days ago
model-00002-of-00007.safetensors
1.99 GB
LFS
Convert model checkpoint to safetensors
20 days ago
model-00003-of-00007.safetensors
1.99 GB
LFS
Convert model checkpoint to safetensors
20 days ago
model-00004-of-00007.safetensors
1.99 GB
LFS
Convert model checkpoint to safetensors
20 days ago
model-00005-of-00007.safetensors
1.93 GB
LFS
Convert model checkpoint to safetensors
20 days ago
model-00006-of-00007.safetensors
1.93 GB
LFS
Convert model checkpoint to safetensors
20 days ago
model-00007-of-00007.safetensors
1.39 GB
LFS
Convert model checkpoint to safetensors
20 days ago
model.safetensors.index.json
24.2 kB
Convert model checkpoint to safetensors
20 days ago
special_tokens_map.json
549 Bytes
Convert model checkpoint to safetensors
20 days ago
tokenizer.json
1.84 MB
Convert model checkpoint to safetensors
20 days ago
tokenizer_config.json
1.1 kB
Convert model checkpoint to safetensors
20 days ago