Edit model card

Model Card for Model ID

Just testing out LLM Finetuning. Finetuned on upstage/SOLAR-10.7B-Instruct-v1.0 using argilla/distilabel-intel-orca-dpo-pairs. Followed the Google Colab mentioned in this article: https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 74.08
AI2 Reasoning Challenge (25-Shot) 71.25
HellaSwag (10-Shot) 88.34
MMLU (5-Shot) 66.04
TruthfulQA (0-shot) 71.36
Winogrande (5-shot) 83.19
GSM8k (5-shot) 64.29
Downloads last month
2,422
Safetensors
Model size
10.7B params
Tensor type
FP16
·

Finetuned from

Dataset used to train dhanushreddy29/BrokenKeyboard

Evaluation results