phi3-dpo / README.md
ludekcizinsky's picture
Update README.md
ca59bdd verified
|
raw
history blame
267 Bytes
metadata
library_name: peft
base_model: unsloth/Phi-3-mini-4k-instruct-bnb-4bit

Model Card for Model ID

DPO finetuned version of phi3-instruct-4k on student annotated preference data focusing on course content questions from EPFL curriculum (physics, math, cs).