Edit model card

Model Card for Model ID

DPO finetuned version of phi3-instruct-4k on student annotated preference data focusing on course content questions from EPFL curriculum (physics, math, cs).

Downloads last month
4
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for cs552-mlp/phi3-dpo

Adapter
(22)
this model

Collection including cs552-mlp/phi3-dpo