vivek-307306 commited on
Commit
d5dc7a8
1 Parent(s): a7ded09

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +57 -8
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
- value: 36.93684031225943
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the kde4 dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 0.9177
36
- - Bleu: 36.9368
37
 
38
  ## Model description
39
 
@@ -52,20 +52,69 @@ More information needed
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
- - learning_rate: 2e-05
56
  - train_batch_size: 2
57
  - eval_batch_size: 2
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
- - num_epochs: 1
62
  - mixed_precision_training: Native AMP
63
 
64
  ### Training results
65
 
66
- | Training Loss | Epoch | Step | Validation Loss | Bleu |
67
- |:-------------:|:-----:|:-----:|:---------------:|:-------:|
68
- | 0.8797 | 1.0 | 59084 | 0.9177 | 36.9368 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
69
 
70
 
71
  ### Framework versions
 
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
+ value: 0.0
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the kde4 dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: nan
36
+ - Bleu: 0.0
37
 
38
  ## Model description
39
 
 
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
+ - learning_rate: 0.002
56
  - train_batch_size: 2
57
  - eval_batch_size: 2
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
+ - num_epochs: 50
62
  - mixed_precision_training: Native AMP
63
 
64
  ### Training results
65
 
66
+ | Training Loss | Epoch | Step | Validation Loss | Bleu |
67
+ |:-------------:|:-----:|:-------:|:---------------:|:----:|
68
+ | 0.0 | 1.0 | 59084 | nan | 0.0 |
69
+ | 0.0 | 2.0 | 118168 | nan | 0.0 |
70
+ | 0.0 | 3.0 | 177252 | nan | 0.0 |
71
+ | 0.0 | 4.0 | 236336 | nan | 0.0 |
72
+ | 0.0 | 5.0 | 295420 | nan | 0.0 |
73
+ | 0.0 | 6.0 | 354504 | nan | 0.0 |
74
+ | 0.0 | 7.0 | 413588 | nan | 0.0 |
75
+ | 0.0 | 8.0 | 472672 | nan | 0.0 |
76
+ | 0.0 | 9.0 | 531756 | nan | 0.0 |
77
+ | 0.0 | 10.0 | 590840 | nan | 0.0 |
78
+ | 0.0 | 11.0 | 649924 | nan | 0.0 |
79
+ | 0.0 | 12.0 | 709008 | nan | 0.0 |
80
+ | 0.0 | 13.0 | 768092 | nan | 0.0 |
81
+ | 0.0 | 14.0 | 827176 | nan | 0.0 |
82
+ | 0.0 | 15.0 | 886260 | nan | 0.0 |
83
+ | 0.0 | 16.0 | 945344 | nan | 0.0 |
84
+ | 0.0 | 17.0 | 1004428 | nan | 0.0 |
85
+ | 0.0 | 18.0 | 1063512 | nan | 0.0 |
86
+ | 0.0 | 19.0 | 1122596 | nan | 0.0 |
87
+ | 0.0 | 20.0 | 1181680 | nan | 0.0 |
88
+ | 0.0 | 21.0 | 1240764 | nan | 0.0 |
89
+ | 0.0 | 22.0 | 1299848 | nan | 0.0 |
90
+ | 0.0 | 23.0 | 1358932 | nan | 0.0 |
91
+ | 0.0 | 24.0 | 1418016 | nan | 0.0 |
92
+ | 0.0 | 25.0 | 1477100 | nan | 0.0 |
93
+ | 0.0 | 26.0 | 1536184 | nan | 0.0 |
94
+ | 0.0 | 27.0 | 1595268 | nan | 0.0 |
95
+ | 0.0 | 28.0 | 1654352 | nan | 0.0 |
96
+ | 0.0 | 29.0 | 1713436 | nan | 0.0 |
97
+ | 0.0 | 30.0 | 1772520 | nan | 0.0 |
98
+ | 0.0 | 31.0 | 1831604 | nan | 0.0 |
99
+ | 0.0 | 32.0 | 1890688 | nan | 0.0 |
100
+ | 0.0 | 33.0 | 1949772 | nan | 0.0 |
101
+ | 0.0 | 34.0 | 2008856 | nan | 0.0 |
102
+ | 0.0 | 35.0 | 2067940 | nan | 0.0 |
103
+ | 0.0 | 36.0 | 2127024 | nan | 0.0 |
104
+ | 0.0 | 37.0 | 2186108 | nan | 0.0 |
105
+ | 0.0 | 38.0 | 2245192 | nan | 0.0 |
106
+ | 0.0 | 39.0 | 2304276 | nan | 0.0 |
107
+ | 0.0 | 40.0 | 2363360 | nan | 0.0 |
108
+ | 0.0 | 41.0 | 2422444 | nan | 0.0 |
109
+ | 0.0 | 42.0 | 2481528 | nan | 0.0 |
110
+ | 0.0 | 43.0 | 2540612 | nan | 0.0 |
111
+ | 0.0 | 44.0 | 2599696 | nan | 0.0 |
112
+ | 0.0 | 45.0 | 2658780 | nan | 0.0 |
113
+ | 0.0 | 46.0 | 2717864 | nan | 0.0 |
114
+ | 0.0 | 47.0 | 2776948 | nan | 0.0 |
115
+ | 0.0 | 48.0 | 2836032 | nan | 0.0 |
116
+ | 0.0 | 49.0 | 2895116 | nan | 0.0 |
117
+ | 0.0 | 50.0 | 2954200 | nan | 0.0 |
118
 
119
 
120
  ### Framework versions