Model Card for Model ID

DPO finetuned version of phi3-instruct-4k on student annotated preference data focusing on course content questions from EPFL curriculum (physics, math, cs).

Downloads last month
0
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for cs552-mlp/phi3-dpo

Adapter
(21)
this model

Collection including cs552-mlp/phi3-dpo