LongshenOu commited on
Commit
75589c7
1 Parent(s): efa1857

End of training

Browse files
README.md CHANGED
@@ -12,6 +12,8 @@ should probably proofread and complete it, then remove this comment. -->
12
  # m2m_pt
13
 
14
  This model was trained from scratch on an unknown dataset.
 
 
15
 
16
  ## Model description
17
 
@@ -40,10 +42,22 @@ The following hyperparameters were used during training:
40
  - lr_scheduler_type: cosine
41
  - lr_scheduler_warmup_steps: 1000
42
  - num_epochs: 1
43
- - mixed_precision_training: Native AMP
44
 
45
  ### Training results
46
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47
 
48
 
49
  ### Framework versions
 
12
  # m2m_pt
13
 
14
  This model was trained from scratch on an unknown dataset.
15
+ It achieves the following results on the evaluation set:
16
+ - Loss: 0.4682
17
 
18
  ## Model description
19
 
 
42
  - lr_scheduler_type: cosine
43
  - lr_scheduler_warmup_steps: 1000
44
  - num_epochs: 1
 
45
 
46
  ### Training results
47
 
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:-----:|:---------------:|
50
+ | 0.7388 | 0.09 | 2000 | 0.6464 |
51
+ | 0.6096 | 0.18 | 4000 | 0.5477 |
52
+ | 0.612 | 0.27 | 6000 | 0.5144 |
53
+ | 0.53 | 0.36 | 8000 | 0.4965 |
54
+ | 0.5744 | 0.45 | 10000 | 0.4856 |
55
+ | 0.5435 | 0.54 | 12000 | 0.4776 |
56
+ | 0.5428 | 0.63 | 14000 | 0.4726 |
57
+ | 0.5065 | 0.72 | 16000 | 0.4696 |
58
+ | 0.5287 | 0.81 | 18000 | 0.4685 |
59
+ | 0.5032 | 0.9 | 20000 | 0.4682 |
60
+ | 0.5451 | 0.99 | 22000 | 0.4682 |
61
 
62
 
63
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fe5962587c35e01e9037daa790dc5f86f180bcf143642595468ec30f0d9d2234
3
  size 174791872
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c74187603ec6a569597054dbd97d8e1b1f82e36085f9ebdb8696060eef728a79
3
  size 174791872
runs/Jun20_10-48-43_smc-gpu3/events.out.tfevents.1718880526.smc-gpu3.225838.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c79af74825517d9ddc1e0beb5f6b1d756d2c79236ee1d77e826666d0515f9c4f
3
- size 940659
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:982c7d427043c891cc026b5cdbe8dfa6c2e976c218fb7dfb93d970acaebed1fc
3
+ size 951124