phi-3.1-qlora-blend-dpo
phi-3.1-qlora-blend-dpo is a merge of the following models using mergekit:
🧩 Configuration
models:
- model: microsoft/Phi-3-mini-4k-instruct
# no parameters necessary for base model
- model: ndavidson/phi-3.1-qlorablend-1
parameters:
density: 0.5
weight: 0.5
- model: ndavidson/phi-3.1-qlorablend-0
parameters:
density: 0.5
weight: 0.5
merge_method: ties
base_model: microsoft/Phi-3-mini-4k-instruct
parameters:
normalize: true
dtype: float16
- Downloads last month
- 7
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.