7b_dpo_iter1_4e7_bz32_step200_only_onpolicy / model-00007-of-00008.safetensors

Commit History

Upload GemmaForCausalLM
29d8af3
verified

1231czx commited on