Edit model card

Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-GGUF

Quantized GGUF model files for phi-2-sft-dpo-gpt4_en-ep1 from Yhyu13

Original Model Card:

This is the merged model for LoRA https://huggingface.co/Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-lora

This model is a dpo improvement to this base model https://huggingface.co/Yhyu13/phi-2-sft-alpaca_gpt4_en-ep1 who achieve better than text-davinci-003 on AlpcaEval judged by ChatGPT.

Downloads last month
91
GGUF
Model size
2.78B params
Architecture
phi2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Inference API (serverless) has been turned off for this model.

Model tree for afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF

Quantized
this model