Pinkstack commited on
Commit
7b10191
·
verified ·
1 Parent(s): d3ed0cf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -12
README.md CHANGED
@@ -148,6 +148,20 @@ It is recommended to use a system prompt like this one:
148
  You are a helpful ai assistant. Make sure to put your finalanswer at the end.
149
  ```
150
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
151
  # 🧀 Examples:
152
  (q4_k_m, 10GB rtx 3080, 64GB memory, running inside of MSTY, all use "You are a friendly ai assistant." as the System prompt.)
153
  **example 1:**
@@ -175,17 +189,5 @@ All generated locally and pretty quickly too! 😲 Due to our very limited resou
175
  - **Finetuned from model :** microsoft/phi-4
176
 
177
  This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
178
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
179
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/Pinkstack__SuperThoughts-CoT-14B-16k-o1-QwQ-details)!
180
- Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=Pinkstack%2FSuperThoughts-CoT-14B-16k-o1-QwQ&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
181
 
182
- | Metric |Value (%)|
183
- |-------------------|--------:|
184
- |**Average** | 31.17|
185
- |IFEval (0-Shot) | 5.15|
186
- |BBH (3-Shot) | 52.85|
187
- |MATH Lvl 5 (4-Shot)| 40.79|
188
- |GPQA (0-shot) | 19.02|
189
- |MuSR (0-shot) | 21.79|
190
- |MMLU-PRO (5-shot) | 47.43|
191
 
 
148
  You are a helpful ai assistant. Make sure to put your finalanswer at the end.
149
  ```
150
 
151
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
152
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/Pinkstack__SuperThoughts-CoT-14B-16k-o1-QwQ-details)!
153
+ Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=Pinkstack%2FSuperThoughts-CoT-14B-16k-o1-QwQ&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
154
+
155
+ | Metric |Value (%)|
156
+ |-------------------|--------:|
157
+ |**Average** | 31.17|
158
+ |IFEval (0-Shot) | 5.15|
159
+ |BBH (3-Shot) | 52.85|
160
+ |MATH Lvl 5 (4-Shot)| 40.79|
161
+ |GPQA (0-shot) | 19.02|
162
+ |MuSR (0-shot) | 21.79|
163
+ |MMLU-PRO (5-shot) | 47.43|
164
+
165
  # 🧀 Examples:
166
  (q4_k_m, 10GB rtx 3080, 64GB memory, running inside of MSTY, all use "You are a friendly ai assistant." as the System prompt.)
167
  **example 1:**
 
189
  - **Finetuned from model :** microsoft/phi-4
190
 
191
  This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
 
192
 
 
 
 
 
 
 
 
 
 
193