Model Card for Model ID

microsoft/Phi-3-medium-4k-instruct trained with ORPO trainer.

Training Details

Training Data

mlabonne/orpo-dpo-mix-40k is used for finetuning this model.

[More Information Needed]

Training Procedure

Trained with ORPO trainer, and only first 5K rows are used for finetuning (5K out of 40K).

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 26.84
IFEval (0-Shot) 40.22
BBH (3-Shot) 46.63
MATH Lvl 5 (4-Shot) 16.69
GPQA (0-shot) 7.38
MuSR (0-shot) 10.53
MMLU-PRO (5-shot) 39.60
Downloads last month
23
Safetensors
Model size
14B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for BlackBeenie/Neos-Phi-3-14B-v0.1

Finetuned
(5)
this model
Quantizations
2 models

Dataset used to train BlackBeenie/Neos-Phi-3-14B-v0.1

Evaluation results