End of training
Browse files
README.md
CHANGED
@@ -44,27 +44,28 @@ The following hyperparameters were used during training:
|
|
44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
45 |
- lr_scheduler_type: cosine
|
46 |
- lr_scheduler_warmup_steps: 30
|
47 |
-
- training_steps:
|
48 |
|
49 |
### Training results
|
50 |
|
51 |
| Training Loss | Epoch | Step | Validation Loss |
|
52 |
|:-------------:|:------:|:----:|:---------------:|
|
53 |
-
| 2.1783 | 0.0625 | 100 | 2.
|
54 |
-
| 1.
|
55 |
-
| 1.
|
56 |
-
| 1.
|
57 |
-
| 1.
|
58 |
-
| 1.
|
59 |
-
| 1.
|
60 |
-
| 1.
|
61 |
-
| 1.
|
62 |
-
| 1.
|
63 |
-
| 1.
|
64 |
-
| 1.
|
65 |
-
| 1.
|
66 |
-
| 1.
|
67 |
-
| 1.
|
|
|
68 |
|
69 |
|
70 |
### Framework versions
|
|
|
44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
45 |
- lr_scheduler_type: cosine
|
46 |
- lr_scheduler_warmup_steps: 30
|
47 |
+
- training_steps: 1600
|
48 |
|
49 |
### Training results
|
50 |
|
51 |
| Training Loss | Epoch | Step | Validation Loss |
|
52 |
|:-------------:|:------:|:----:|:---------------:|
|
53 |
+
| 2.1783 | 0.0625 | 100 | 2.1006 |
|
54 |
+
| 1.8426 | 0.125 | 200 | 1.8535 |
|
55 |
+
| 1.7343 | 0.1875 | 300 | 1.7350 |
|
56 |
+
| 1.6313 | 0.25 | 400 | 1.6520 |
|
57 |
+
| 1.5817 | 0.3125 | 500 | 1.5982 |
|
58 |
+
| 1.5498 | 0.375 | 600 | 1.5604 |
|
59 |
+
| 1.5019 | 0.4375 | 700 | 1.5322 |
|
60 |
+
| 1.4852 | 0.5 | 800 | 1.5103 |
|
61 |
+
| 1.461 | 0.5625 | 900 | 1.4939 |
|
62 |
+
| 1.4483 | 0.625 | 1000 | 1.4820 |
|
63 |
+
| 1.4434 | 0.6875 | 1100 | 1.4723 |
|
64 |
+
| 1.4254 | 0.75 | 1200 | 1.4659 |
|
65 |
+
| 1.4224 | 0.8125 | 1300 | 1.4619 |
|
66 |
+
| 1.4188 | 0.875 | 1400 | 1.4596 |
|
67 |
+
| 1.4245 | 0.9375 | 1500 | 1.4585 |
|
68 |
+
| 1.4172 | 1.0 | 1600 | 1.4578 |
|
69 |
|
70 |
|
71 |
### Framework versions
|