laurentiubp commited on
Commit
0f82ac1
1 Parent(s): 377f50a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -61,6 +61,8 @@ print(outputs[0]["generated_text"][len(prompt):])
61
 
62
  The model was trained **without a prompt template**, only with raw text separated by BOS and EOS tokens.
63
 
 
 
64
  Example:
65
 
66
  ```text
@@ -105,4 +107,4 @@ The following hyperparameters were used during training:
105
 
106
  **Out-of-scope** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3 Community License. Use in languages other than English**.
107
 
108
- **Note: Developers may fine-tune Llama 3 models for languages beyond English provided they comply with the Llama 3 Community License and the Acceptable Use Policy.
 
61
 
62
  The model was trained **without a prompt template**, only with raw text separated by BOS and EOS tokens.
63
 
64
+ The model was trained for two epochs on **6x A100 80GB GPUs using DeepSpeed ZeRO** State-3 without CPU offloading.
65
+
66
  Example:
67
 
68
  ```text
 
107
 
108
  **Out-of-scope** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3 Community License. Use in languages other than English**.
109
 
110
+ **Note: Developers may fine-tune Llama 3 models for languages beyond English provided they comply with the Llama 3 Community License and the Acceptable Use Policy.