manojpreveen
commited on
Commit
•
a940cbb
1
Parent(s):
0e9cf15
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ Instruction Tuned GPT-NeoXT-20B model on Instruction Tuning dataset as listed be
|
|
9 |
**Base Model:** togethercomputer/GPT-NeoXT-Chat-Base-20B (GPT-NeoXT-Chat-Base-20B-v0.16 - fine-tuned on feedback data)
|
10 |
|
11 |
**Training Details :**
|
12 |
-
* Epochs:
|
13 |
* Batch Size : 5 instantaneous per device x 1 gradient accumulation steps x 8 gpus = 40
|
14 |
* Block Size : 2020
|
15 |
* Weight Decay : 0
|
|
|
9 |
**Base Model:** togethercomputer/GPT-NeoXT-Chat-Base-20B (GPT-NeoXT-Chat-Base-20B-v0.16 - fine-tuned on feedback data)
|
10 |
|
11 |
**Training Details :**
|
12 |
+
* Epochs: 4
|
13 |
* Batch Size : 5 instantaneous per device x 1 gradient accumulation steps x 8 gpus = 40
|
14 |
* Block Size : 2020
|
15 |
* Weight Decay : 0
|