phi-2-gpo-newSFT-b0.001-i0

This model is a fine-tuned version of DUAL-GPO/phi-2-sft-lora-ultrachat-merged on the HuggingFaceH4/ultrafeedback_binarized dataset.

Downloads last month
11
Safetensors
Model size
2.78B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for DUAL-GPO/phi-2-gpo-new-i0

Adapters
43 models