Edit model card

Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-GGUF

Quantized GGUF model files for phi-2-sft-dpo-gpt4_en-ep1 from Yhyu13

Original Model Card:

This is the merged model for LoRA https://huggingface.co/Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-lora

This model is a dpo improvement to this base model https://huggingface.co/Yhyu13/phi-2-sft-alpaca_gpt4_en-ep1 who achieve better than text-davinci-003 on AlpcaEval judged by ChatGPT.

Downloads last month
128
GGUF
Model size
2.78B params
Architecture
phi2
+1
Inference Examples
Inference API (serverless) has been turned off for this model.

Quantized from