nash_dpo_rank4_iter_3 / config.json

Commit History

DPO-7b-beta0.01
d7192ff
verified

YYYYYYibo commited on