Shresthadev403 commited on
Commit
dab44ff
1 Parent(s): a3aaac8

End of training

Browse files
README.md CHANGED
@@ -13,12 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - eval_loss: 4.6987
17
- - eval_runtime: 0.0184
18
- - eval_samples_per_second: 54.299
19
- - eval_steps_per_second: 54.299
20
- - epoch: 50.0
21
- - step: 50
22
 
23
  ## Model description
24
 
@@ -48,6 +43,17 @@ The following hyperparameters were used during training:
48
  - num_epochs: 50
49
  - mixed_precision_training: Native AMP
50
 
 
 
 
 
 
 
 
 
 
 
 
51
  ### Framework versions
52
 
53
  - Transformers 4.36.0
 
13
 
14
  This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 4.6987
 
 
 
 
 
17
 
18
  ## Model description
19
 
 
43
  - num_epochs: 50
44
  - mixed_precision_training: Native AMP
45
 
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:----:|:---------------:|
50
+ | 0.3826 | 10.0 | 10 | 4.4464 |
51
+ | 0.207 | 20.0 | 20 | 4.5212 |
52
+ | 0.1171 | 30.0 | 30 | 4.5379 |
53
+ | 0.0764 | 40.0 | 40 | 4.6038 |
54
+ | 0.063 | 50.0 | 50 | 4.6987 |
55
+
56
+
57
  ### Framework versions
58
 
59
  - Transformers 4.36.0
logs/events.out.tfevents.1705983035.70e47a1f5afe.42.11 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:40e3106771c36c1a648980573eab6d7576766a4a44e6df90f9c4ecf149a8fcd5
3
- size 7086
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6a97b5844d202a7ddb4953cff5f69b653fcbf61773c37080f9b1ade5d8d08721
3
+ size 7434
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:45d04dad0a7c2b4a89b02a87f0f119ab92b550cd7e094ff234eb79f7dd90d2a7
3
  size 435756040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b1f2ed33d49476ecdf341fac292ec64b65edf6c4f0c485b9fcbbab100d62d596
3
  size 435756040
trainer_state.json CHANGED
@@ -77,6 +77,15 @@
77
  "eval_samples_per_second": 54.299,
78
  "eval_steps_per_second": 54.299,
79
  "step": 50
 
 
 
 
 
 
 
 
 
80
  }
81
  ],
82
  "logging_steps": 10,
 
77
  "eval_samples_per_second": 54.299,
78
  "eval_steps_per_second": 54.299,
79
  "step": 50
80
+ },
81
+ {
82
+ "epoch": 50.0,
83
+ "step": 50,
84
+ "total_flos": 32856154788600.0,
85
+ "train_loss": 0.012607929706573486,
86
+ "train_runtime": 24.4386,
87
+ "train_samples_per_second": 18.413,
88
+ "train_steps_per_second": 2.046
89
  }
90
  ],
91
  "logging_steps": 10,