Training in progress, epoch 0

Files changed (7) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [MCG-NJU/videomae-large](https://huggingface.co/MCG-NJU/videomae-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4257
 ## Model description
@@ -42,17 +42,22 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- training_steps: 2235
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.7063        | 0.2   | 447  | 0.6715          |
-| 0.6721        | 1.2   | 894  | 0.6247          |
-| 0.4569        | 2.2   | 1341 | 0.6014          |
-| 0.3716        | 3.2   | 1788 | 0.5119          |
-| 0.3029        | 4.2   | 2235 | 0.4187          |
 ### Framework versions

 This model is a fine-tuned version of [MCG-NJU/videomae-large](https://huggingface.co/MCG-NJU/videomae-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3932
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- training_steps: 4470
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.6361        | 0.1   | 447  | 0.6478          |
+| 0.6774        | 1.1   | 894  | 0.6047          |
+| 0.4168        | 2.1   | 1341 | 0.4852          |
+| 0.4427        | 3.1   | 1788 | 0.8547          |
+| 0.4496        | 4.1   | 2235 | 0.3795          |
+| 0.3433        | 5.1   | 2682 | 0.4119          |
+| 0.2287        | 6.1   | 3129 | 0.4823          |
+| 0.1297        | 7.1   | 3576 | 0.4295          |
+| 0.3104        | 8.1   | 4023 | 0.4096          |
+| 0.0525        | 9.1   | 4470 | 0.4154          |
 ### Framework versions

all_results.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-    "epoch": 4.2,
-    "eval_loss": 0.42570874094963074,
-    "eval_runtime": 1996.1684,
-    "eval_samples_per_second": 9.919,
-    "eval_steps_per_second": 1.24
 }

 {
+    "epoch": 9.1,
+    "eval_loss": 0.39315125346183777,
+    "eval_runtime": 426.8234,
+    "eval_samples_per_second": 2.577,
+    "eval_steps_per_second": 0.323
 }

config.json CHANGED Viewed

@@ -27,7 +27,7 @@
   "norm_pix_loss": true,
   "num_attention_heads": 16,
   "num_channels": 3,
-  "num_frames": 16,
   "num_hidden_layers": 24,
   "patch_size": 16,
   "problem_type": "single_label_classification",

   "norm_pix_loss": true,
   "num_attention_heads": 16,
   "num_channels": 3,
+  "num_frames": 24,
   "num_hidden_layers": 24,
   "patch_size": 16,
   "problem_type": "single_label_classification",

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f8a5a5fd53f2332c3a406e96d82b4a205141b32a23899d61efc041af6b2db3c9
-size 607770476

 version https://git-lfs.github.com/spec/v1
+oid sha256:d9b2a820649ec150c6b3f8624c614c25ba4b58e6c157423f294589a1b5ba3168
+size 1215496248

test_results.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-    "epoch": 4.2,
-    "eval_loss": 0.42570874094963074,
-    "eval_runtime": 1996.1684,
-    "eval_samples_per_second": 9.919,
-    "eval_steps_per_second": 1.24
 }

 {
+    "epoch": 9.1,
+    "eval_loss": 0.39315125346183777,
+    "eval_runtime": 426.8234,
+    "eval_samples_per_second": 2.577,
+    "eval_steps_per_second": 0.323
 }

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:01a112613c865f994cbccea1bcf9259c90dca35ee00515fe413ecacd862d2da9
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:e962655c185a779d829f8a4f97e6fb4b2e6c32bcc264b8fb531c5d5599c6d579
 size 5240