MedQA_L3_250steps_1e7rate_05beta_CSFTDPO / model-00003-of-00004.safetensors

Commit History

End of training
c911926
verified

tsavage68 commited on