Update README.md
Browse files
README.md
CHANGED
@@ -55,7 +55,7 @@ The optimizer used is AdaFactor with inverse square root learning rate schedule
|
|
55 |
|
56 |
### Fine-tuning
|
57 |
|
58 |
-
This model was then fine-tuned on a single TPU Pod V2-8 for 500 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing
|
59 |
|
60 |
|
61 |
## Evaluation results
|
|
|
55 |
|
56 |
### Fine-tuning
|
57 |
|
58 |
+
This model was then fine-tuned on a single TPU Pod V2-8 for 500 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing sql code.
|
59 |
|
60 |
|
61 |
## Evaluation results
|