Base model: open-thoughts/OpenThinker-7B
open-thoughts/OpenThinker-7B
SFT model: yufeng1/OpenThinker-7B-reasoning-full-lora-max-type3-e5
yufeng1/OpenThinker-7B-reasoning-full-lora-max-type3-e5
Merge method: slerp
slerp
Alpha: 0.25
0.25
Validation reasoning rate: 98.66666666666667
98.66666666666667
Chat template
Files info