mistral_gsm8k_sft_and_dpo_4_beta_5 / adapter_model.safetensors

Commit History

initial adapter with SFT+DPO on gsm8k preference dataset and 1 epoch
649d127
verified

valerielucro commited on