Edit model card

Fine-tuning on Intel Gaudi2

This model is a fine-tuned model based on mistralai/Mistral-7B-v0.1 on the open source dataset Open-Orca/SlimOrca. Then we align it with DPO algorithm. For more details, you can refer our blog: The Practice of Supervised Fine-tuning and Direct Preference Optimization on Intel Gaudi2.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 65.52
AI2 Reasoning Challenge (25-Shot) 66.64
HellaSwag (10-Shot) 82.12
MMLU (5-Shot) 62.37
TruthfulQA (0-shot) 60.22
Winogrande (5-shot) 79.64
GSM8k (5-shot) 42.15
Downloads last month
3,449
Safetensors
Model size
10.7B params
Tensor type
FP16
·

Evaluation results