MedQA_L3_1000steps_1e5rate_03beta_CSFTDPO / model-00001-of-00004.safetensors

Commit History

End of training
b784446
verified

tsavage68 commited on