peft adapter with SFT on gsm8k preference dataset and 1 epoch f7893ed verified valerielucro commited on Jun 19