Llama
Collection
Meta-based models
•
1004 items
•
Updated
•
1
This is a merge of pre-trained language models created using mergekit.
Llama 3.2 3b model combined with lora trained on reasoning datasets.
This model was merged using the Passthrough merge method using cognitivecomputations/Dolphin3.0-Llama3.2-3B + bunnycore/Llama-3.2-3B-R1-lora as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: cognitivecomputations/Dolphin3.0-Llama3.2-3B+bunnycore/Llama-3.2-3B-R1-lora
dtype: bfloat16
merge_method: passthrough
models:
- model: cognitivecomputations/Dolphin3.0-Llama3.2-3B+bunnycore/Llama-3.2-3B-R1-lora
tokenizer_source: cognitivecomputations/Dolphin3.0-Llama3.2-3B
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 13.63 |
IFEval (0-Shot) | 43.52 |
BBH (3-Shot) | 12.93 |
MATH Lvl 5 (4-Shot) | 4.23 |
GPQA (0-shot) | 2.24 |
MuSR (0-shot) | 6.44 |
MMLU-PRO (5-shot) | 12.41 |