nash_dpo_rank4_iter_3 / train_results.json

Commit History

Model save
73a7e32
verified

YYYYYYibo commited on