Konstantinos
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -46,7 +46,7 @@ language: el
|
|
46 |
|
47 |
## Training details:
|
48 |
|
49 |
-
The current snapshot has been trained for 40hrs with
|
50 |
|
51 |
|
52 |
## Dataset:
|
|
|
46 |
|
47 |
## Training details:
|
48 |
|
49 |
+
The current snapshot has been trained for 40hrs with an RTX A6000 GPU (48G), using the `galore_adamw8bit_per_layer` optimizer by Zhao et. al [1] and a context size of 1024 tokens.
|
50 |
|
51 |
|
52 |
## Dataset:
|