Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Xiaodong
/
Next-DPO-iter2
like
0
Safetensors
Xiaodong/DPO-iter2-data-8k
Model card
Files
Files and versions
Community
b40531c
Next-DPO-iter2
/
checkpoint-500
/
latest
Wang-Xiaodong1899
upload ckpt
d4e8c62
3 months ago
raw
Copy download link
history
blame
Safe
14 Bytes
global_step500