Ragab167 commited on
Commit
5120151
1 Parent(s): e3abaab

End of training

Browse files
Files changed (3) hide show
  1. README.md +10 -8
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.6403
19
 
20
  ## Model description
21
 
@@ -36,21 +36,23 @@ More information needed
36
  The following hyperparameters were used during training:
37
  - learning_rate: 5e-05
38
  - train_batch_size: 64
39
- - eval_batch_size: 64
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 3
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 3.7937 | 0.53 | 100 | 0.9563 |
50
- | 0.7881 | 1.06 | 200 | 0.6899 |
51
- | 0.617 | 1.6 | 300 | 0.6622 |
52
- | 0.6014 | 2.13 | 400 | 0.6468 |
53
- | 0.5157 | 2.66 | 500 | 0.6403 |
 
 
54
 
55
 
56
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6394
19
 
20
  ## Model description
21
 
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 5e-05
38
  - train_batch_size: 64
39
+ - eval_batch_size: 32
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 4
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 3.7525 | 0.53 | 100 | 0.9179 |
50
+ | 0.7831 | 1.06 | 200 | 0.6902 |
51
+ | 0.6143 | 1.6 | 300 | 0.6662 |
52
+ | 0.5988 | 2.13 | 400 | 0.6493 |
53
+ | 0.5075 | 2.66 | 500 | 0.6415 |
54
+ | 0.4766 | 3.19 | 600 | 0.6408 |
55
+ | 0.4445 | 3.72 | 700 | 0.6394 |
56
 
57
 
58
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:22d0f4b654df6fc49e188a220d85f62f875f8c4e7708500c3cab3d0503d0ffcd
3
  size 1935681888
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e819ac3a8b292cae7c145ca2936f26b903faac4248da139e34f138d85a55ed0
3
  size 1935681888
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b823d0c5c9aac29284f1c01633432e7c8eca274f9327c7443e64fecaea1d3c82
3
  size 4984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a06de3b7ce2b9dc64c466884365b1102dcbce806083d307b467f1bf977ae27a
3
  size 4984