drewcurran commited on
Commit
badaa15
1 Parent(s): 406b979

End of training

Browse files
README.md CHANGED
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.8461
19
- - Bleu: 1.3828
20
- - Gen Len: 18.0112
21
 
22
  ## Model description
23
 
@@ -42,16 +42,15 @@ The following hyperparameters were used during training:
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
- - num_epochs: 3
46
  - mixed_precision_training: Native AMP
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
51
- |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
52
- | 2.1283 | 1.0 | 4674 | 1.8698 | 1.3343 | 18.024 |
53
- | 2.1211 | 2.0 | 9348 | 1.8516 | 1.3656 | 18.0194 |
54
- | 2.1063 | 3.0 | 14022 | 1.8461 | 1.3828 | 18.0112 |
55
 
56
 
57
  ### Framework versions
 
15
 
16
  This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.8172
19
+ - Bleu: 1.4433
20
+ - Gen Len: 18.005
21
 
22
  ## Model description
23
 
 
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - num_epochs: 2
46
  - mixed_precision_training: Native AMP
47
 
48
  ### Training results
49
 
50
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
51
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
52
+ | 2.0545 | 1.0 | 4674 | 1.8267 | 1.4183 | 18.0116 |
53
+ | 2.0645 | 2.0 | 9348 | 1.8172 | 1.4433 | 18.005 |
 
54
 
55
 
56
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:febb4d6a5a1788f89efcab8abfc36fc172769ff9d3932ecc7e4ddf4102b95161
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c8115546398ac76ac46777e086f64138e166fd4140217fbe15347862dc4438c8
3
  size 242041896
runs/May07_07-56-44_366c39a864bc/events.out.tfevents.1715068609.366c39a864bc.1848.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fde133dedd328e0cbfa61e456e2b82a9342a2a42c00dba4a0ecabea424cd1848
3
- size 9904
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5425f6da4e53f1c81de59b2ab1ba70f7f8a486562d60adb7581f510a6b8eab6
3
+ size 10628