Commit History

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch
9c07365
verified

valerielucro commited on

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch
cfeb3fa
verified

valerielucro commited on