mistralit2_1000_STEPS_1e8_rate_0.1_beta_DPO / model-00001-of-00003.safetensors

Commit History

End of training
34c0a5c
verified

tsavage68 commited on