Adding Evaluation Results

#1
Files changed (1) hide show
  1. README.md +16 -3
README.md CHANGED
@@ -1,11 +1,11 @@
1
  ---
2
- library_name: transformers
3
  license: llama3.2
4
- base_model: unsloth/Llama-3.2-3B-Instruct
5
  tags:
6
  - llama-factory
7
  - freeze
8
  - generated_from_trainer
 
9
  model-index:
10
  - name: Llama-3.2-3B-Instruct-24-9-29
11
  results: []
@@ -76,4 +76,17 @@ The following hyperparameters were used during training:
76
  - Transformers 4.45.0
77
  - Pytorch 2.4.0+cu124
78
  - Datasets 2.19.1
79
- - Tokenizers 0.20.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
2
  license: llama3.2
3
+ library_name: transformers
4
  tags:
5
  - llama-factory
6
  - freeze
7
  - generated_from_trainer
8
+ base_model: unsloth/Llama-3.2-3B-Instruct
9
  model-index:
10
  - name: Llama-3.2-3B-Instruct-24-9-29
11
  results: []
 
76
  - Transformers 4.45.0
77
  - Pytorch 2.4.0+cu124
78
  - Datasets 2.19.1
79
+ - Tokenizers 0.20.0
80
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
81
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_win10__Llama-3.2-3B-Instruct-24-9-29)
82
+
83
+ | Metric |Value|
84
+ |-------------------|----:|
85
+ |Avg. |23.70|
86
+ |IFEval (0-Shot) |73.32|
87
+ |BBH (3-Shot) |24.20|
88
+ |MATH Lvl 5 (4-Shot)|15.26|
89
+ |GPQA (0-shot) | 3.24|
90
+ |MuSR (0-shot) | 1.44|
91
+ |MMLU-PRO (5-shot) |24.76|
92
+