MedQA_L3_450steps_1e7rate_03beta_CSFTDPO / model-00004-of-00004.safetensors

Commit History

End of training
e16cf98
verified

tsavage68 commited on