mistral_gsm8k_dpo / tokenizer.json
valerielucro's picture
adapter trained with DPO on the gsm8k preference dataset and 1 epoch
f00f5bf verified
raw
history
No virus
1.8 MB
File too large to display, you can check the raw version instead.