Safetensors
English
llama
alignment-handbook
trl
dpo
Generated from Trainer

Commit History

Update README.md
30e6682
verified

Zhangchen Xu commited on

Update README.md
9ebe5f7
verified

Zhangchen Xu commited on

Update README.md
88d17cd
verified

Zhangchen Xu commited on

End of training
f98f101
verified

Zhangchen Xu commited on

Model save
d8eec6f
verified

Zhangchen Xu commited on

Training in progress, step 765
cf8884e
verified

Zhangchen Xu commited on

Training in progress, step 700
037bbde
verified

Zhangchen Xu commited on

Training in progress, step 600
0fa5b18
verified

Zhangchen Xu commited on

Training in progress, step 500
0a183b3
verified

Zhangchen Xu commited on

Training in progress, step 400
b8c4d60
verified

Zhangchen Xu commited on

Training in progress, step 300
fa08f18
verified

Zhangchen Xu commited on

Training in progress, step 200
bf37a10
verified

Zhangchen Xu commited on

Training in progress, step 100
f92c0a2
verified

Zhangchen Xu commited on

initial commit
ba80c4f
verified

Zhangchen Xu commited on