Reason/COT
Collection
12 items
•
Updated
•
3
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Detailed results can be found here! Summarized results can be found here!
Metric | Value (%) |
---|---|
Average | 7.45 |
IFEval (0-Shot) | 25.67 |
BBH (3-Shot) | 6.55 |
MATH Lvl 5 (4-Shot) | 0.00 |
GPQA (0-shot) | 3.13 |
MuSR (0-shot) | 1.60 |
MMLU-PRO (5-shot) | 7.76 |