valerielucro's picture
adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch
6b24dac verified
raw
history contribute delete
No virus
1.8 MB
File too large to display, you can check the raw version instead.