hartmani jburtoft commited on
Commit
ed6a4ed
1 Parent(s): ffa6fd7

Update README.md (#1)

Browse files

- Update README.md (fce00e1c53390581af7daa2a83a3c996e2b712d3)


Co-authored-by: Jim Burtoft <jburtoft@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -9,7 +9,7 @@ This model experiment was inspired by the work published in [Goat: Fine-tuned LL
9
 
10
  Fine tuning of [philschmid/Llama-2-7b-hf](https://huggingface.co/philschmid/Llama-2-7b-hf) was conducted with 2.8M math problems from the [AtlasUnified/atlas-math-sets](https://huggingface.co/datasets/AtlasUnified/atlas-math-sets) dataset.
11
 
12
- Training was conducted on a trn1.32xlarge instance. The model here was complied for 2 Neuron cores, which will run on AWS inf2.8xlarge and larger instances.
13
 
14
  The table below highlights performance improvements observed on basic math questions.
15
 
@@ -37,4 +37,4 @@ p = pipeline('text-generation', model="hartmani/llama2-7B-AtlasMath-2.8M", token
37
  p("What is the addition of 591 + 642", max_new_tokens=64, do_sample=True, top_k=50)
38
  ```
39
 
40
- **Disclaimer:** *This model was generated for research purposes only and may produce inconsistent or inaccurate results. There are obviously far better ways to have computers perform basic math calculations. This model simply demonstrates the ease of teaching a Llama2-7B model masic math techniques.*
 
9
 
10
  Fine tuning of [philschmid/Llama-2-7b-hf](https://huggingface.co/philschmid/Llama-2-7b-hf) was conducted with 2.8M math problems from the [AtlasUnified/atlas-math-sets](https://huggingface.co/datasets/AtlasUnified/atlas-math-sets) dataset.
11
 
12
+ Training was conducted on a trn1.32xlarge instance. The model here was compiled for 2 Neuron cores, which will run on AWS inf2.8xlarge and larger instances.
13
 
14
  The table below highlights performance improvements observed on basic math questions.
15
 
 
37
  p("What is the addition of 591 + 642", max_new_tokens=64, do_sample=True, top_k=50)
38
  ```
39
 
40
+ **Disclaimer:** *This model was generated for research purposes only and may produce inconsistent or inaccurate results. There are obviously far better ways to have computers perform basic math calculations. This model simply demonstrates the ease of teaching a Llama2-7B model basic math techniques.*