MedQA_L3_300steps_1e7rate_05beta_CSFTDPO / model-00002-of-00004.safetensors

Commit History

End of training
7850793
verified

tsavage68 commited on