MaziyarPanahi commited on
Commit
9fe1102
1 Parent(s): 46f39db

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -15
README.md CHANGED
@@ -135,9 +135,18 @@ This model is an advanced iteration of the powerful Qwen/Qwen2.5-3B, fine-tuned
135
  All GGUF models are available here: [MaziyarPanahi/calme-3.2-baguette-3b-GGUF](https://huggingface.co/MaziyarPanahi/calme-3.2-baguette-3b-GGUF)
136
 
137
 
138
- # 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 
139
 
140
- Leaderboard 2 coming soon!
 
 
 
 
 
 
 
 
141
 
142
 
143
  # Prompt Template
@@ -184,16 +193,3 @@ model = AutoModelForCausalLM.from_pretrained("MaziyarPanahi/calme-3.2-baguette-3
184
  # Ethical Considerations
185
 
186
  As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.
187
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
188
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__calme-3.2-baguette-3b)
189
-
190
- | Metric |Value|
191
- |-------------------|----:|
192
- |Avg. |22.14|
193
- |IFEval (0-Shot) |63.38|
194
- |BBH (3-Shot) |25.87|
195
- |MATH Lvl 5 (4-Shot)| 3.10|
196
- |GPQA (0-shot) | 5.93|
197
- |MuSR (0-shot) | 8.60|
198
- |MMLU-PRO (5-shot) |25.98|
199
-
 
135
  All GGUF models are available here: [MaziyarPanahi/calme-3.2-baguette-3b-GGUF](https://huggingface.co/MaziyarPanahi/calme-3.2-baguette-3b-GGUF)
136
 
137
 
138
+ # 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
139
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__calme-3.2-baguette-3b)
140
 
141
+ | Metric |Value|
142
+ |-------------------|----:|
143
+ |Avg. |22.14|
144
+ |IFEval (0-Shot) |63.38|
145
+ |BBH (3-Shot) |25.87|
146
+ |MATH Lvl 5 (4-Shot)| 3.10|
147
+ |GPQA (0-shot) | 5.93|
148
+ |MuSR (0-shot) | 8.60|
149
+ |MMLU-PRO (5-shot) |25.98|
150
 
151
 
152
  # Prompt Template
 
193
  # Ethical Considerations
194
 
195
  As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.