Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
treasure4l
/
Llama3.2-Instruct-DPO
like
0
Safetensors
trl-lib/ultrafeedback_binarized
arxiv:
1910.09700
Model card
Files
Files and versions
Community
main
Llama3.2-Instruct-DPO
Commit History
Update README.md
64b174c
verified
treasure4l
commited on
Jan 14
Upload 6 files
fb37779
verified
treasure4l
commited on
Jan 14
Create README.md
fb474f6
verified
treasure4l
commited on
Jan 14
initial commit
da84868
verified
treasure4l
commited on
Jan 14