mistralit2_500_STEPS_1e8_rate_03_beta_DPO / model-00001-of-00003.safetensors

Commit History

End of training
116758d
verified

tsavage68 commited on