This model is Llemma-7b model used in the paper "An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models". It's based on Llemma-7b and was further finetuned MetaMath with special format for reward. Each step starts with "Step" and ends with "\u043a\u0438".

Downloads last month
304
Safetensors
Model size
6.74B params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the docs .

Collection including tkitsers/Llemma-metamath-7b