mistral_gsm8k_dpo_cot_beta_0.9 / adapter_config.json

Commit History

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch
487d117
verified

valerielucro commited on