mistral_gsm8k_resumed / adapter_model.safetensors

Commit History

adaptor with DPO onfull gsm8k preference dataset and 1 epoch
bfdf157
verified

valerielucro commited on