Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
vangard703
/
DPO-PairRM-5-SMI-lr-1e6-iteration-5-t-7e-beta-15e3-2-iteration-6e1-confidence
like
0
Model card
Files
Files and versions
Community
main
DPO-PairRM-5-SMI-lr-1e6-iteration-5-t-7e-beta-15e3-2-iteration-6e1-confidence
Commit History
initial commit
29e8ec0
verified
vangard703
commited on
Apr 24