sanchit-gandhi HF staff commited on
Commit
9505d95
1 Parent(s): 18da6e8

update model card README.md

Browse files
README.md CHANGED
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model was trained from scratch on the xtreme_s dataset.
19
  It achieves the following results on the evaluation set:
 
20
  - Bleu: 0.0000
21
- - Loss: 1.6425
22
 
23
  ## Model description
24
 
@@ -37,12 +37,12 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.00023509256443134124
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
- - gradient_accumulation_steps: 8
45
- - total_train_batch_size: 64
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - lr_scheduler_warmup_steps: 500
@@ -51,27 +51,17 @@ The following hyperparameters were used during training:
51
 
52
  ### Training results
53
 
54
- | Training Loss | Epoch | Step | Bleu | Validation Loss |
55
- |:-------------:|:-----:|:----:|:------:|:---------------:|
56
- | 2.8303 | 0.15 | 500 | 0.0 | 4.9238 |
57
- | 2.4062 | 0.31 | 1000 | 0.0 | 4.4017 |
58
- | 1.9171 | 0.46 | 1500 | 0.0000 | 3.6431 |
59
- | 1.4558 | 0.62 | 2000 | 0.0000 | 2.8292 |
60
- | 1.2393 | 0.77 | 2500 | 0.0000 | 2.3704 |
61
- | 1.3315 | 0.93 | 3000 | 0.0000 | 2.3101 |
62
- | 1.8476 | 1.08 | 3500 | 0.0000 | 1.9936 |
63
- | 1.683 | 1.23 | 4000 | 0.0000 | 1.9308 |
64
- | 1.8298 | 1.39 | 4500 | 0.0000 | 1.8817 |
65
- | 1.5955 | 1.54 | 5000 | 0.0000 | 1.8171 |
66
- | 1.6288 | 1.7 | 5500 | 0.0000 | 1.7821 |
67
- | 1.4107 | 1.85 | 6000 | 0.0000 | 1.7170 |
68
- | 1.0363 | 2.01 | 6500 | 0.0000 | 1.7419 |
69
- | 0.9667 | 2.16 | 7000 | 0.0000 | 1.7309 |
70
- | 0.9147 | 2.31 | 7500 | 0.0000 | 1.7244 |
71
- | 1.1975 | 2.47 | 8000 | 0.0000 | 1.6716 |
72
- | 0.9071 | 2.62 | 8500 | 0.0000 | 1.6886 |
73
- | 0.9735 | 2.78 | 9000 | 0.0000 | 1.6609 |
74
- | 0.908 | 2.93 | 9500 | 0.0000 | 1.6425 |
75
 
76
 
77
  ### Framework versions
 
17
 
18
  This model was trained from scratch on the xtreme_s dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.7768
21
  - Bleu: 0.0000
 
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.0003
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
+ - gradient_accumulation_steps: 16
45
+ - total_train_batch_size: 128
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - lr_scheduler_warmup_steps: 500
 
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Bleu |
55
+ |:-------------:|:-----:|:----:|:---------------:|:------:|
56
+ | 2.5511 | 0.31 | 500 | 5.1039 | 0.0 |
57
+ | 2.2033 | 0.62 | 1000 | 4.1782 | 0.0000 |
58
+ | 1.4703 | 0.93 | 1500 | 2.8979 | 0.0000 |
59
+ | 1.6507 | 1.23 | 2000 | 2.2250 | 0.0000 |
60
+ | 1.6791 | 1.54 | 2500 | 2.0530 | 0.0000 |
61
+ | 1.4587 | 1.85 | 3000 | 1.9121 | 0.0000 |
62
+ | 1.288 | 2.16 | 3500 | 1.8705 | 0.0000 |
63
+ | 1.2244 | 2.47 | 4000 | 1.7940 | 0.0000 |
64
+ | 1.0364 | 2.78 | 4500 | 1.7768 | 0.0000 |
 
 
 
 
 
 
 
 
 
 
65
 
66
 
67
  ### Framework versions
wandb/run-20220505_173818-i9acyhfo/files/output.log CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2192ab1040218726cf08d6108176c5547d818ea56f5f7aa9b6813a899189b0b8
3
- size 16288756
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e89684c4b6746ae738e3ddaa427cf9d90c966dd964975e6c83cc9286c986321c
3
+ size 16290323
wandb/run-20220505_173818-i9acyhfo/logs/debug-internal.log CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e883f57d9e77232f2cf3a3fde646766c69e97db702e1b1a7c7d12e4b494f669e
3
- size 14780150
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:51104230cbff36190fb046f99c188978d0072b182cec0fb29ce099113c8c84d1
3
+ size 14783102
wandb/run-20220505_173818-i9acyhfo/run-i9acyhfo.wandb CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:654fb5c5647c98ae01af1ccbafa2354d65d474f4c254d33a30c76dcf57ab6ee4
3
- size 1045114644
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19b65ff5e38c62ffeab088591c02c2726ee560f2f55bbf6e994f8ca1036f5c6a
3
+ size 1045118132