Update README.md (#1)
Browse files- Update README.md (fce00e1c53390581af7daa2a83a3c996e2b712d3)
Co-authored-by: Jim Burtoft <jburtoft@users.noreply.huggingface.co>
README.md
CHANGED
@@ -9,7 +9,7 @@ This model experiment was inspired by the work published in [Goat: Fine-tuned LL
|
|
9 |
|
10 |
Fine tuning of [philschmid/Llama-2-7b-hf](https://huggingface.co/philschmid/Llama-2-7b-hf) was conducted with 2.8M math problems from the [AtlasUnified/atlas-math-sets](https://huggingface.co/datasets/AtlasUnified/atlas-math-sets) dataset.
|
11 |
|
12 |
-
Training was conducted on a trn1.32xlarge instance. The model here was
|
13 |
|
14 |
The table below highlights performance improvements observed on basic math questions.
|
15 |
|
@@ -37,4 +37,4 @@ p = pipeline('text-generation', model="hartmani/llama2-7B-AtlasMath-2.8M", token
|
|
37 |
p("What is the addition of 591 + 642", max_new_tokens=64, do_sample=True, top_k=50)
|
38 |
```
|
39 |
|
40 |
-
**Disclaimer:** *This model was generated for research purposes only and may produce inconsistent or inaccurate results. There are obviously far better ways to have computers perform basic math calculations. This model simply demonstrates the ease of teaching a Llama2-7B model
|
|
|
9 |
|
10 |
Fine tuning of [philschmid/Llama-2-7b-hf](https://huggingface.co/philschmid/Llama-2-7b-hf) was conducted with 2.8M math problems from the [AtlasUnified/atlas-math-sets](https://huggingface.co/datasets/AtlasUnified/atlas-math-sets) dataset.
|
11 |
|
12 |
+
Training was conducted on a trn1.32xlarge instance. The model here was compiled for 2 Neuron cores, which will run on AWS inf2.8xlarge and larger instances.
|
13 |
|
14 |
The table below highlights performance improvements observed on basic math questions.
|
15 |
|
|
|
37 |
p("What is the addition of 591 + 642", max_new_tokens=64, do_sample=True, top_k=50)
|
38 |
```
|
39 |
|
40 |
+
**Disclaimer:** *This model was generated for research purposes only and may produce inconsistent or inaccurate results. There are obviously far better ways to have computers perform basic math calculations. This model simply demonstrates the ease of teaching a Llama2-7B model basic math techniques.*
|