Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
  license: mit
3
- base_model: microsoft/phi-2
4
  tags:
5
  - generated_from_trainer
 
6
  model-index:
7
  - name: phi-sft-out
8
  results: []
@@ -149,3 +149,17 @@ The following hyperparameters were used during training:
149
  - Pytorch 2.1.2+cu121
150
  - Datasets 2.16.1
151
  - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
 
3
  tags:
4
  - generated_from_trainer
5
+ base_model: microsoft/phi-2
6
  model-index:
7
  - name: phi-sft-out
8
  results: []
 
149
  - Pytorch 2.1.2+cu121
150
  - Datasets 2.16.1
151
  - Tokenizers 0.15.0
152
+
153
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
154
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_NovoCode__Phi-2-DPO)
155
+
156
+ | Metric |Value|
157
+ |---------------------------------|----:|
158
+ |Avg. |61.25|
159
+ |AI2 Reasoning Challenge (25-Shot)|60.75|
160
+ |HellaSwag (10-Shot) |75.03|
161
+ |MMLU (5-Shot) |57.75|
162
+ |TruthfulQA (0-shot) |44.46|
163
+ |Winogrande (5-shot) |73.64|
164
+ |GSM8k (5-shot) |55.88|
165
+