mistral_gsm8k_dpo / tokenizer_config.json

Commit History

adapter trained with DPO on the gsm8k preference dataset and 1 epoch
f00f5bf
verified

valerielucro commited on