Phi-2 Super (SFT + cDPO)

image/png

Description

This repo contains 4-bit Marlin format model files for abacaj's Phi-2 Super

Phi-2-super (SFT + cDPO)

Base Model: microsoft/phi-2

Chat template

The model uses the same chat template as found in Mistral instruct models:

text = "<|endoftext|>[INST] What is your favourite condiment? [/INST]"
"Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!<|endoftext|> "
"[INST] Do you have mayonnaise recipes? [/INST]"

MT-bench / heval

image/png image/png

Downloads last month
5
Safetensors
Model size
597M params
Tensor type
I32
·
FP16
·
Inference Examples
Inference API (serverless) has been turned off for this model.