upaya07
/

Arithmo2-Mistral-7B

Text Generation

Mathematical Reasoning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

akjindal53244 commited on Jan 14

Commit

d809af3

•

1 Parent(s): fc46879

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ datasets:
 ## Model Details
-**Arithmo2-7B** is improved version of [Arithmi-Mistral-7B](https://huggingface.co/akjindal53244/Arithmo-Mistral-7B) model and is trained to reason and answer mathematical problems and is also capable of writing a Python program that upon execution prints answer to the question. We used [Mistral-7B](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base model and used **QLoRA to fine-tune it on a single GPU**.
 ### Model Description
@@ -30,7 +30,7 @@ datasets:
 ## Results
-Arithmo2-7B is improved version of [Arithmi-Mistral-7B](https://huggingface.co/akjindal53244/Arithmo-Mistral-7B) model and is competitive with full fine-tuned state-of-the-art 7B Mathematical Reasoning models. Refer to [Comparing Arithmo-Mistral-7B with other LLM models](https://github.com/akjindal53244/Arithmo-Mistral-7B/tree/master#comparing-arithmo-mistral-7b-with-other-llm-models) section for more details.
 <table>
     <thead>
@@ -102,7 +102,7 @@ Plugging these values into the formula, we get:
 The answer is: 55
 ```
-Arithmo-Mistral-7B is trained with the following format:
 #### CoT Format (generate reasoning steps with answer):
 ```
 Question: <question>
@@ -118,7 +118,7 @@ Answer:
 ```
 It will perform best if queried in this way with your own script.
-## Comparing Arithmo-Mistral-7B with other LLM models.
 Results for all models except `Arithmo2-7B` are taken from [MetaMath](https://github.com/meta-math/MetaMath/blob/main/README.MD) repository.
 | Model               | GSM8k Pass@1 | MATH Pass@1 | Fine-tuning |

 ## Model Details
+**Arithmo2-7B** is improved version of [Arithmo-Mistral-7B](https://huggingface.co/akjindal53244/Arithmo-Mistral-7B) model and is trained to reason and answer mathematical problems and is also capable of writing a Python program that upon execution prints answer to the question. We used [Mistral-7B](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base model and used **QLoRA to fine-tune it on a single GPU**.
 ### Model Description
 ## Results
+Arithmo2-7B is improved version of [Arithmo-Mistral-7B](https://huggingface.co/akjindal53244/Arithmo-Mistral-7B) model and is competitive with full fine-tuned state-of-the-art 7B Mathematical Reasoning models. Refer to [Comparing Arithmo-Mistral-7B with other LLM models](https://github.com/akjindal53244/Arithmo-Mistral-7B/tree/master#comparing-arithmo-mistral-7b-with-other-llm-models) section for more details.
 <table>
     <thead>
 The answer is: 55
 ```
+Arithmo2-7B is trained with the following format:
 #### CoT Format (generate reasoning steps with answer):
 ```
 Question: <question>
 ```
 It will perform best if queried in this way with your own script.
+## Comparing Arithmo2-7B with other LLM models.
 Results for all models except `Arithmo2-7B` are taken from [MetaMath](https://github.com/meta-math/MetaMath/blob/main/README.MD) repository.
 | Model               | GSM8k Pass@1 | MATH Pass@1 | Fine-tuning |