Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ashercn97
/
reward-train-facebook-opt350m-with-hh-rlhf
like
0
PEFT
Safetensors
Generated from Trainer
License:
other
Model card
Files
Files and versions
Community
Use this model
283b0da
reward-train-facebook-opt350m-with-hh-rlhf
/
vocab.json
Commit History
reward-train-facebook-opt350m-with-hh-rlhf
e471f3f
verified
ashercn97
commited on
Jul 25