Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Xiaodong
/
Next-DPO-iter2
like
0
Safetensors
Xiaodong/DPO-iter2-data-8k
Model card
Files
Files and versions
Community
b40531c
Next-DPO-iter2
/
checkpoint-500
/
rng_state_2.pth
Commit History
upload ckpt
d4e8c62
Wang-Xiaodong1899
commited on
Oct 13