Files changed (1) hide show
  1. README.md +49 -35
README.md CHANGED
@@ -1,35 +1,49 @@
1
- ---
2
- license: other
3
- library_name: transformers
4
- tags:
5
- - generated_from_trainer
6
- base_model: Qwen/Qwen2.5-3B
7
- license_name: qwen-research
8
- license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
9
- model-index:
10
- - name: outputs/gelato-3b
11
- results: []
12
- ---
13
-
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
- Prompt Format: **ChatML**
18
-
19
- This is an experimental which was heavily optimized for reasoning tasks and not meant for production-use.
20
-
21
- GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF
22
-
23
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
24
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_arcee-ai__raspberry-3B)
25
-
26
- | Metric |Value|
27
- |-------------------|----:|
28
- |Avg. |15.40|
29
- |IFEval (0-Shot) |31.54|
30
- |BBH (3-Shot) |19.53|
31
- |MATH Lvl 5 (4-Shot)| 7.63|
32
- |GPQA (0-shot) | 3.69|
33
- |MuSR (0-shot) | 9.41|
34
- |MMLU-PRO (5-shot) |20.60|
35
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ library_name: transformers
4
+ tags:
5
+ - generated_from_trainer
6
+ base_model: Qwen/Qwen2.5-3B
7
+ license_name: qwen-research
8
+ license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
9
+ language:
10
+ - zho
11
+ - eng
12
+ - fra
13
+ - spa
14
+ - por
15
+ - deu
16
+ - ita
17
+ - rus
18
+ - jpn
19
+ - kor
20
+ - vie
21
+ - tha
22
+ - ara
23
+ model-index:
24
+ - name: outputs/gelato-3b
25
+ results: []
26
+ ---
27
+
28
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
+ should probably proofread and complete it, then remove this comment. -->
30
+
31
+ Prompt Format: **ChatML**
32
+
33
+ This is an experimental which was heavily optimized for reasoning tasks and not meant for production-use.
34
+
35
+ GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF
36
+
37
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
38
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_arcee-ai__raspberry-3B)
39
+
40
+ | Metric |Value|
41
+ |-------------------|----:|
42
+ |Avg. |15.40|
43
+ |IFEval (0-Shot) |31.54|
44
+ |BBH (3-Shot) |19.53|
45
+ |MATH Lvl 5 (4-Shot)| 7.63|
46
+ |GPQA (0-shot) | 3.69|
47
+ |MuSR (0-shot) | 9.41|
48
+ |MMLU-PRO (5-shot) |20.60|
49
+