This model is Llemma-34b model used in the paper "An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models". It's based on Llemma-34b and was further finetuned MetaMath with special format for reward. Each step starts with "Step" and ends with "\u043a\u0438".

Downloads last month
174
Safetensors
Model size
33.7B params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the docs .

Collection including tkitsers/Llemma-metamath-34b