End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,12 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 4.6987
-- eval_runtime: 0.0184
-- eval_samples_per_second: 54.299
-- eval_steps_per_second: 54.299
-- epoch: 50.0
-- step: 50
 ## Model description
@@ -48,6 +43,17 @@ The following hyperparameters were used during training:
 - num_epochs: 50
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.36.0

 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.6987
 ## Model description
 - num_epochs: 50
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.3826        | 10.0  | 10   | 4.4464          |
+| 0.207         | 20.0  | 20   | 4.5212          |
+| 0.1171        | 30.0  | 30   | 4.5379          |
+| 0.0764        | 40.0  | 40   | 4.6038          |
+| 0.063         | 50.0  | 50   | 4.6987          |
 ### Framework versions
 - Transformers 4.36.0

logs/events.out.tfevents.1705983035.70e47a1f5afe.42.11 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:40e3106771c36c1a648980573eab6d7576766a4a44e6df90f9c4ecf149a8fcd5
-size 7086

 version https://git-lfs.github.com/spec/v1
+oid sha256:6a97b5844d202a7ddb4953cff5f69b653fcbf61773c37080f9b1ade5d8d08721
+size 7434

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:45d04dad0a7c2b4a89b02a87f0f119ab92b550cd7e094ff234eb79f7dd90d2a7
 size 435756040

 version https://git-lfs.github.com/spec/v1
+oid sha256:b1f2ed33d49476ecdf341fac292ec64b65edf6c4f0c485b9fcbbab100d62d596
 size 435756040

trainer_state.json CHANGED Viewed

@@ -77,6 +77,15 @@
       "eval_samples_per_second": 54.299,
       "eval_steps_per_second": 54.299,
       "step": 50
     }
   ],
   "logging_steps": 10,

       "eval_samples_per_second": 54.299,
       "eval_steps_per_second": 54.299,
       "step": 50
+    },
+    {
+      "epoch": 50.0,
+      "step": 50,
+      "total_flos": 32856154788600.0,
+      "train_loss": 0.012607929706573486,
+      "train_runtime": 24.4386,
+      "train_samples_per_second": 18.413,
+      "train_steps_per_second": 2.046
     }
   ],
   "logging_steps": 10,