vanilla_dpo_iter_4 / trainer_state.json

Commit History

Model save
2cbbc30
verified

ShenaoZ commited on