Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,5 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
---
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
---
|
4 |
+
This model is Llemma-34b model used in the paper ["An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models"](https://arxiv.org/abs/2408.00724).
|
5 |
+
It's based on Llemma-34b and was further finetuned MetaMath with special format for reward. Each step starts with "Step" and ends with "\u043a\u0438".
|