Marcos12886 commited on Sep 4, 2024

Commit

4c99118

•

1 Parent(s): 969bb37

Upload folder using huggingface_hub

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

README.md +9 -33
checkpoint-111/model.safetensors +1 -1
checkpoint-111/optimizer.pt +1 -1
checkpoint-111/scheduler.pt +1 -1
checkpoint-111/trainer_state.json +33 -51
checkpoint-111/training_args.bin +1 -1
checkpoint-126/config.json +84 -0
checkpoint-126/model.safetensors +3 -0
checkpoint-126/optimizer.pt +3 -0
checkpoint-126/rng_state.pth +3 -0
checkpoint-126/scheduler.pt +3 -0
checkpoint-126/trainer_state.json +105 -0
checkpoint-126/training_args.bin +3 -0
checkpoint-18/model.safetensors +1 -1
checkpoint-18/optimizer.pt +1 -1
checkpoint-18/scheduler.pt +1 -1
checkpoint-18/trainer_state.json +8 -11
checkpoint-18/training_args.bin +1 -1
checkpoint-37/model.safetensors +1 -1
checkpoint-37/optimizer.pt +1 -1
checkpoint-37/scheduler.pt +1 -1
checkpoint-37/trainer_state.json +14 -20
checkpoint-37/training_args.bin +1 -1
checkpoint-54/model.safetensors +1 -1
checkpoint-54/optimizer.pt +1 -1
checkpoint-54/rng_state.pth +2 -2
checkpoint-54/trainer_state.json +25 -16
checkpoint-54/training_args.bin +1 -1
checkpoint-55/model.safetensors +1 -1
checkpoint-55/optimizer.pt +1 -1
checkpoint-55/scheduler.pt +1 -1
checkpoint-55/trainer_state.json +18 -27
checkpoint-55/training_args.bin +1 -1
checkpoint-74/model.safetensors +1 -1
checkpoint-74/optimizer.pt +1 -1
checkpoint-74/scheduler.pt +1 -1
checkpoint-74/trainer_state.json +23 -35
checkpoint-74/training_args.bin +1 -1
checkpoint-93/model.safetensors +1 -1
checkpoint-93/optimizer.pt +1 -1
checkpoint-93/scheduler.pt +1 -1
checkpoint-93/trainer_state.json +29 -44
checkpoint-93/training_args.bin +1 -1
model.safetensors +1 -1
runs/Sep02_21-37-15_ubumarcos/events.out.tfevents.1725305838.ubumarcos +3 -0
runs/Sep02_23-16-29_ubumarcos/events.out.tfevents.1725311792.ubumarcos +3 -0
runs/Sep02_23-18-00_ubumarcos/events.out.tfevents.1725311883.ubumarcos +3 -0
runs/Sep03_00-14-06_ubumarcos/events.out.tfevents.1725315248.ubumarcos +3 -0
runs/Sep03_13-16-32_ubumarcos/events.out.tfevents.1725362195.ubumarcos +3 -0
runs/Sep03_13-18-34_ubumarcos/events.out.tfevents.1725362316.ubumarcos +3 -0

README.md CHANGED Viewed

@@ -8,9 +8,6 @@ datasets:
 - audiofolder
 metrics:
 - accuracy
-- f1
-- precision
-- recall
 model-index:
 - name: distilhubert-finetuned-mixed-data
   results:
@@ -26,16 +23,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.9026845637583892
-    - name: F1
-      type: f1
-      value: 0.9017814679012008
-    - name: Precision
-      type: precision
-      value: 0.901095676384633
-    - name: Recall
-      type: recall
-      value: 0.9026845637583892
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -45,11 +33,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on the audiofolder dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2976
-- Accuracy: 0.9027
-- F1: 0.9018
-- Precision: 0.9011
-- Recall: 0.9027
 ## Model description
@@ -77,24 +62,15 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.001
-- num_epochs: 12
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
-|:-------------:|:-------:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| No log        | 0.9664  | 18   | 0.6696          | 0.7819   | 0.7264 | 0.6898    | 0.7819 |
-| No log        | 1.9866  | 37   | 0.5068          | 0.7752   | 0.7203 | 0.6849    | 0.7752 |
-| No log        | 2.9530  | 55   | 0.4304          | 0.8087   | 0.7535 | 0.7242    | 0.8087 |
-| No log        | 3.9732  | 74   | 0.4109          | 0.8523   | 0.8434 | 0.8728    | 0.8523 |
-| No log        | 4.9933  | 93   | 0.3263          | 0.8725   | 0.8718 | 0.8719    | 0.8725 |
-| No log        | 5.9597  | 111  | 0.3036          | 0.8826   | 0.8824 | 0.8824    | 0.8826 |
-| No log        | 6.9799  | 130  | 0.3046          | 0.8893   | 0.8876 | 0.8892    | 0.8893 |
-| No log        | 8.0     | 149  | 0.3244          | 0.8758   | 0.8770 | 0.8787    | 0.8758 |
-| No log        | 8.9664  | 167  | 0.2962          | 0.9027   | 0.9018 | 0.9012    | 0.9027 |
-| No log        | 9.9866  | 186  | 0.2971          | 0.9027   | 0.9010 | 0.9014    | 0.9027 |
-| No log        | 10.9530 | 204  | 0.2974          | 0.9094   | 0.9082 | 0.9077    | 0.9094 |
-| No log        | 11.5973 | 216  | 0.2976          | 0.9027   | 0.9018 | 0.9011    | 0.9027 |
 ### Framework versions

 - audiofolder
 metrics:
 - accuracy
 model-index:
 - name: distilhubert-finetuned-mixed-data
   results:
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.7919463087248322
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on the audiofolder dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4952
+- Accuracy: 0.7919
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.001
+- num_epochs: 3
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Accuracy |
+|:-------------:|:------:|:----:|:---------------:|:--------:|
+| No log        | 0.9664 | 18   | 0.7078          | 0.7584   |
+| No log        | 1.9866 | 37   | 0.5109          | 0.7852   |
+| No log        | 2.8993 | 54   | 0.4952          | 0.7919   |
 ### Framework versions

checkpoint-111/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8afb68ff2611e3603dee528e572e6fc36c40e47cec34c6ee683636922c8055e1
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:d6cfaf117c65cf6ad96ea90019d829932297275d9e389ebcc4ceeb3e2d099f17
 size 94765560

checkpoint-111/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:105e2897a04f3a74cea8691093cd964cb3e2476cb37888e94bd97ac020d7bc23
 size 189556666

 version https://git-lfs.github.com/spec/v1
+oid sha256:50c80d46c6ed45a1190e000a005d9359cffdc2f8b2d211aab96d7fddd708c357
 size 189556666

checkpoint-111/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fd5bf82e806804b25d214305b99f2c178bc6f19a61f077621eaca5b3cb5523cd
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:e2dc9ec36b920afe942421e68f88dd5c12762bff976ab7c419a2e088a9640109
 size 1064

checkpoint-111/trainer_state.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "best_metric": 0.8825503355704698,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-111",
   "epoch": 5.959731543624161,
   "eval_steps": 500,
@@ -10,81 +10,63 @@
   "log_history": [
     {
       "epoch": 0.9664429530201343,
-      "eval_accuracy": 0.7818791946308725,
-      "eval_f1": 0.7264205130236912,
-      "eval_loss": 0.669560968875885,
-      "eval_precision": 0.689807639599501,
-      "eval_recall": 0.7818791946308725,
-      "eval_runtime": 0.9033,
-      "eval_samples_per_second": 329.896,
-      "eval_steps_per_second": 42.067,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
-      "eval_accuracy": 0.7751677852348994,
-      "eval_f1": 0.7202681570933687,
-      "eval_loss": 0.5067932605743408,
-      "eval_precision": 0.684911313518696,
-      "eval_recall": 0.7751677852348994,
-      "eval_runtime": 0.907,
-      "eval_samples_per_second": 328.546,
-      "eval_steps_per_second": 41.895,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
-      "eval_accuracy": 0.8087248322147651,
-      "eval_f1": 0.7535236037076262,
-      "eval_loss": 0.43038079142570496,
-      "eval_precision": 0.7241626365959,
-      "eval_recall": 0.8087248322147651,
-      "eval_runtime": 0.8664,
-      "eval_samples_per_second": 343.963,
-      "eval_steps_per_second": 43.861,
       "step": 55
     },
     {
       "epoch": 3.9731543624161074,
-      "eval_accuracy": 0.8523489932885906,
-      "eval_f1": 0.8433916249277822,
-      "eval_loss": 0.4109182059764862,
-      "eval_precision": 0.8727817866814688,
-      "eval_recall": 0.8523489932885906,
-      "eval_runtime": 0.8712,
-      "eval_samples_per_second": 342.059,
-      "eval_steps_per_second": 43.618,
       "step": 74
     },
     {
       "epoch": 4.993288590604027,
-      "eval_accuracy": 0.87248322147651,
-      "eval_f1": 0.8717711524765707,
-      "eval_loss": 0.3263051509857178,
-      "eval_precision": 0.8718521382399975,
-      "eval_recall": 0.87248322147651,
-      "eval_runtime": 0.87,
-      "eval_samples_per_second": 342.548,
-      "eval_steps_per_second": 43.681,
       "step": 93
     },
     {
       "epoch": 5.959731543624161,
-      "eval_accuracy": 0.8825503355704698,
-      "eval_f1": 0.8824400125399595,
-      "eval_loss": 0.3035907447338104,
-      "eval_precision": 0.8824270850226767,
-      "eval_recall": 0.8825503355704698,
-      "eval_runtime": 0.8921,
-      "eval_samples_per_second": 334.055,
-      "eval_steps_per_second": 42.598,
       "step": 111
     }
   ],
   "logging_steps": 500,
-  "max_steps": 216,
   "num_input_tokens_seen": 0,
-  "num_train_epochs": 12,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

 {
+  "best_metric": 0.8657718120805369,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-111",
   "epoch": 5.959731543624161,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.686046838760376,
+      "eval_runtime": 3.2719,
+      "eval_samples_per_second": 91.079,
+      "eval_steps_per_second": 11.614,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
+      "eval_accuracy": 0.802013422818792,
+      "eval_loss": 0.46226799488067627,
+      "eval_runtime": 3.3286,
+      "eval_samples_per_second": 89.527,
+      "eval_steps_per_second": 11.416,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
+      "eval_accuracy": 0.8187919463087249,
+      "eval_loss": 0.4068666100502014,
+      "eval_runtime": 3.2087,
+      "eval_samples_per_second": 92.871,
+      "eval_steps_per_second": 11.843,
       "step": 55
     },
     {
       "epoch": 3.9731543624161074,
+      "eval_accuracy": 0.8355704697986577,
+      "eval_loss": 0.3811332583427429,
+      "eval_runtime": 3.2325,
+      "eval_samples_per_second": 92.188,
+      "eval_steps_per_second": 11.755,
       "step": 74
     },
     {
       "epoch": 4.993288590604027,
+      "eval_accuracy": 0.8355704697986577,
+      "eval_loss": 0.3542439937591553,
+      "eval_runtime": 3.2746,
+      "eval_samples_per_second": 91.003,
+      "eval_steps_per_second": 11.604,
       "step": 93
     },
     {
       "epoch": 5.959731543624161,
+      "eval_accuracy": 0.8657718120805369,
+      "eval_loss": 0.33795884251594543,
+      "eval_runtime": 3.2548,
+      "eval_samples_per_second": 91.556,
+      "eval_steps_per_second": 11.675,
       "step": 111
     }
   ],
   "logging_steps": 500,
+  "max_steps": 126,
   "num_input_tokens_seen": 0,
+  "num_train_epochs": 7,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

checkpoint-111/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:296be9afae72ab3934d873f0cf92f87ef76899c18b11651de670afb49aa1a5d6
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:4320ed7eb3857f3356f3c0fd71b66d450b29bc6f61001ac820f978865e977454
 size 5240

checkpoint-126/config.json ADDED Viewed

	@@ -0,0 +1,84 @@

+{
+  "_name_or_path": "ntu-spml/distilhubert",
+  "activation_dropout": 0.1,
+  "apply_spec_augment": false,
+  "architectures": [
+    "HubertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "bos_token_id": 1,
+  "classifier_proj_size": 256,
+  "conv_bias": false,
+  "conv_dim": [
+    512,
+    512,
+    512,
+    512,
+    512,
+    512,
+    512
+  ],
+  "conv_kernel": [
+    10,
+    3,
+    3,
+    3,
+    3,
+    2,
+    2
+  ],
+  "conv_stride": [
+    5,
+    2,
+    2,
+    2,
+    2,
+    2,
+    2
+  ],
+  "ctc_loss_reduction": "sum",
+  "ctc_zero_infinity": false,
+  "do_stable_layer_norm": false,
+  "eos_token_id": 2,
+  "feat_extract_activation": "gelu",
+  "feat_extract_norm": "group",
+  "feat_proj_dropout": 0.0,
+  "feat_proj_layer_norm": false,
+  "final_dropout": 0.0,
+  "hidden_act": "gelu",
+  "hidden_dropout": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "1s_asphyxia",
+    "1": "1s_hunger",
+    "2": "1s_normal",
+    "3": "1s_pain"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "1s_asphyxia": "0",
+    "1s_hunger": "1",
+    "1s_normal": "2",
+    "1s_pain": "3"
+  },
+  "layer_norm_eps": 1e-05,
+  "layerdrop": 0.0,
+  "mask_feature_length": 10,
+  "mask_feature_min_masks": 0,
+  "mask_feature_prob": 0.0,
+  "mask_time_length": 10,
+  "mask_time_min_masks": 2,
+  "mask_time_prob": 0.05,
+  "model_type": "hubert",
+  "num_attention_heads": 12,
+  "num_conv_pos_embedding_groups": 16,
+  "num_conv_pos_embeddings": 128,
+  "num_feat_extract_layers": 7,
+  "num_hidden_layers": 2,
+  "pad_token_id": 0,
+  "torch_dtype": "float32",
+  "transformers_version": "4.44.2",
+  "use_weighted_layer_sum": false,
+  "vocab_size": 32
+}

checkpoint-126/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4510ed7e13f4bc4dd001eb3a533578bbc919b6f595428ad96456f352af1cc8e6
+size 94765560

checkpoint-126/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a5d9819ed97d940763643a3cfd6dc3ffac7621d0614842f5a7e5119aa1adc61d
+size 189556666

checkpoint-126/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2756daa1d15b38a73c17f51c8dd3dc3188afaca774220382259e764150eca057
+size 14308

checkpoint-126/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1e1524b5fb3e7ae768641537cfd8745bb359ffd8b97cac004ef74c63f2b5b06c
+size 1064

checkpoint-126/trainer_state.json ADDED Viewed

	@@ -0,0 +1,105 @@

+{
+  "best_metric": 0.8691275167785235,
+  "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-126",
+  "epoch": 6.76510067114094,
+  "eval_steps": 500,
+  "global_step": 126,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.686046838760376,
+      "eval_runtime": 3.2719,
+      "eval_samples_per_second": 91.079,
+      "eval_steps_per_second": 11.614,
+      "step": 18
+    },
+    {
+      "epoch": 1.9865771812080537,
+      "eval_accuracy": 0.802013422818792,
+      "eval_loss": 0.46226799488067627,
+      "eval_runtime": 3.3286,
+      "eval_samples_per_second": 89.527,
+      "eval_steps_per_second": 11.416,
+      "step": 37
+    },
+    {
+      "epoch": 2.953020134228188,
+      "eval_accuracy": 0.8187919463087249,
+      "eval_loss": 0.4068666100502014,
+      "eval_runtime": 3.2087,
+      "eval_samples_per_second": 92.871,
+      "eval_steps_per_second": 11.843,
+      "step": 55
+    },
+    {
+      "epoch": 3.9731543624161074,
+      "eval_accuracy": 0.8355704697986577,
+      "eval_loss": 0.3811332583427429,
+      "eval_runtime": 3.2325,
+      "eval_samples_per_second": 92.188,
+      "eval_steps_per_second": 11.755,
+      "step": 74
+    },
+    {
+      "epoch": 4.993288590604027,
+      "eval_accuracy": 0.8355704697986577,
+      "eval_loss": 0.3542439937591553,
+      "eval_runtime": 3.2746,
+      "eval_samples_per_second": 91.003,
+      "eval_steps_per_second": 11.604,
+      "step": 93
+    },
+    {
+      "epoch": 5.959731543624161,
+      "eval_accuracy": 0.8657718120805369,
+      "eval_loss": 0.33795884251594543,
+      "eval_runtime": 3.2548,
+      "eval_samples_per_second": 91.556,
+      "eval_steps_per_second": 11.675,
+      "step": 111
+    },
+    {
+      "epoch": 6.76510067114094,
+      "eval_accuracy": 0.8691275167785235,
+      "eval_loss": 0.33603718876838684,
+      "eval_runtime": 3.1993,
+      "eval_samples_per_second": 93.145,
+      "eval_steps_per_second": 11.877,
+      "step": 126
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 126,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 7,
+  "save_steps": 500,
+  "stateful_callbacks": {
+    "EarlyStoppingCallback": {
+      "args": {
+        "early_stopping_patience": 3,
+        "early_stopping_threshold": 0.0
+      },
+      "attributes": {
+        "early_stopping_patience_counter": 0
+      }
+    },
+    "TrainerControl": {
+      "args": {
+        "should_epoch_stop": false,
+        "should_evaluate": false,
+        "should_log": false,
+        "should_save": true,
+        "should_training_stop": true
+      },
+      "attributes": {}
+    }
+  },
+  "total_flos": 1.831207226112e+16,
+  "train_batch_size": 8,
+  "trial_name": null,
+  "trial_params": null
+}

checkpoint-126/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4320ed7eb3857f3356f3c0fd71b66d450b29bc6f61001ac820f978865e977454
+size 5240

checkpoint-18/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:64f51b8591199469762fae24f74ba430f094c3690d9c48c1cd603b5de70546cc
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:fedc5342e503764722ceaa1636b7c8498a2aa3bdaaef986214edb574f615af9b
 size 94765560

checkpoint-18/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3e666f8131b7924c8e7c848969c0e2bd68f8876fd6d4f4be6390692b1d8660d2
 size 189556666

 version https://git-lfs.github.com/spec/v1
+oid sha256:cc2385515662f218746793346cf7c0642966c0a2fc816dba4bfca3f4f9045571
 size 189556666

checkpoint-18/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5c3bfa84a5a5584f0c303cb2a5ebd02ca6effcf09b8f28364ebe555fedff3ef2
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:79e952bae3d444fea9fb53c21720e163f1659ebc933422f10b6da73663ab7443
 size 1064

checkpoint-18/trainer_state.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "best_metric": 0.7818791946308725,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-18",
   "epoch": 0.9664429530201343,
   "eval_steps": 500,
@@ -10,21 +10,18 @@
   "log_history": [
     {
       "epoch": 0.9664429530201343,
-      "eval_accuracy": 0.7818791946308725,
-      "eval_f1": 0.7264205130236912,
-      "eval_loss": 0.669560968875885,
-      "eval_precision": 0.689807639599501,
-      "eval_recall": 0.7818791946308725,
-      "eval_runtime": 0.9033,
-      "eval_samples_per_second": 329.896,
-      "eval_steps_per_second": 42.067,
       "step": 18
     }
   ],
   "logging_steps": 500,
-  "max_steps": 216,
   "num_input_tokens_seen": 0,
-  "num_train_epochs": 12,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

 {
+  "best_metric": 0.7583892617449665,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-18",
   "epoch": 0.9664429530201343,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.7078245878219604,
+      "eval_runtime": 3.2791,
+      "eval_samples_per_second": 90.879,
+      "eval_steps_per_second": 11.589,
       "step": 18
     }
   ],
   "logging_steps": 500,
+  "max_steps": 54,
   "num_input_tokens_seen": 0,
+  "num_train_epochs": 3,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

checkpoint-18/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:296be9afae72ab3934d873f0cf92f87ef76899c18b11651de670afb49aa1a5d6
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:3b74ef34b0c98ff8e2f446712958b67f0e9dae2c892b2706cb653c6c4a3cba29
 size 5240

checkpoint-37/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bd1228d12cca62942f22b7cd2a53221799d4a8f6f0c65cecd82e15483991b3bc
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:0cb23af36fc9e2835fb254591859fcc2a29bed34d849c9632574cf4e143c2ffe
 size 94765560

checkpoint-37/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:770512ff23ef603be29f7cf1b3dd36bbdc3cd34ae0b9d588aa475cdae922d2c0
 size 189556666

 version https://git-lfs.github.com/spec/v1
+oid sha256:7b3ee13fed17092efc92242544dbde28f36e42511a859596375a636a90508461
 size 189556666

checkpoint-37/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f744fe2b4201b36e575e55f1dd02cb6625ec800d3b29b1053597233d2f6239fa
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:d61595ea9a3653f16ffab19b4744db7fac53b6e9dd3b6b622460e0cb5901f7cd
 size 1064

checkpoint-37/trainer_state.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
-  "best_metric": 0.7818791946308725,
-  "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-18",
   "epoch": 1.9865771812080537,
   "eval_steps": 500,
   "global_step": 37,
@@ -10,33 +10,27 @@
   "log_history": [
     {
       "epoch": 0.9664429530201343,
-      "eval_accuracy": 0.7818791946308725,
-      "eval_f1": 0.7264205130236912,
-      "eval_loss": 0.669560968875885,
-      "eval_precision": 0.689807639599501,
-      "eval_recall": 0.7818791946308725,
-      "eval_runtime": 0.9033,
-      "eval_samples_per_second": 329.896,
-      "eval_steps_per_second": 42.067,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
-      "eval_accuracy": 0.7751677852348994,
-      "eval_f1": 0.7202681570933687,
-      "eval_loss": 0.5067932605743408,
-      "eval_precision": 0.684911313518696,
-      "eval_recall": 0.7751677852348994,
-      "eval_runtime": 0.907,
-      "eval_samples_per_second": 328.546,
-      "eval_steps_per_second": 41.895,
       "step": 37
     }
   ],
   "logging_steps": 500,
-  "max_steps": 216,
   "num_input_tokens_seen": 0,
-  "num_train_epochs": 12,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

 {
+  "best_metric": 0.785234899328859,
+  "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-37",
   "epoch": 1.9865771812080537,
   "eval_steps": 500,
   "global_step": 37,
   "log_history": [
     {
       "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.7078245878219604,
+      "eval_runtime": 3.2791,
+      "eval_samples_per_second": 90.879,
+      "eval_steps_per_second": 11.589,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
+      "eval_accuracy": 0.785234899328859,
+      "eval_loss": 0.5109438300132751,
+      "eval_runtime": 3.2689,
+      "eval_samples_per_second": 91.162,
+      "eval_steps_per_second": 11.625,
       "step": 37
     }
   ],
   "logging_steps": 500,
+  "max_steps": 54,
   "num_input_tokens_seen": 0,
+  "num_train_epochs": 3,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

checkpoint-37/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:296be9afae72ab3934d873f0cf92f87ef76899c18b11651de670afb49aa1a5d6
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:3b74ef34b0c98ff8e2f446712958b67f0e9dae2c892b2706cb653c6c4a3cba29
 size 5240

checkpoint-54/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ddc06b1a02837e21e95387003f1e736a75ada2cc311f92602719c1edbbc04f50
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:5aa09bf08a82037b1a7e97fc26fb24670bc804169b2866e90999fb465178164d
 size 94765560

checkpoint-54/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:21244447ac8c1fa88b7aeba15d59e7279b15722946d0143bcbee60e3d2bf3ce7
 size 189556666

 version https://git-lfs.github.com/spec/v1
+oid sha256:753d093b226b6f0025d58bd26e5f213b98a28689a979e9ea31621429593e3533
 size 189556666

checkpoint-54/rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9a2fce6e9deba3361ccb9abfc78de8b2f74d3006b3dab904337047451f90d0ba
-size 14244

 version https://git-lfs.github.com/spec/v1
+oid sha256:c9fc2ffa6937057fd69aff15425ea0616520fc92b279414eaf3b97409628bc19
+size 14308

checkpoint-54/trainer_state.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "best_metric": 0.802013422818792,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-54",
   "epoch": 2.899328859060403,
   "eval_steps": 500,
@@ -10,29 +10,29 @@
   "log_history": [
     {
       "epoch": 0.9664429530201343,
-      "eval_accuracy": 0.7684563758389261,
-      "eval_loss": 0.7515301704406738,
-      "eval_runtime": 0.7411,
-      "eval_samples_per_second": 402.118,
-      "eval_steps_per_second": 51.277,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
-      "eval_accuracy": 0.7953020134228188,
-      "eval_loss": 0.5268774628639221,
-      "eval_runtime": 0.7559,
-      "eval_samples_per_second": 394.247,
-      "eval_steps_per_second": 50.273,
       "step": 37
     },
     {
       "epoch": 2.899328859060403,
-      "eval_accuracy": 0.802013422818792,
-      "eval_loss": 0.4906807839870453,
-      "eval_runtime": 0.739,
-      "eval_samples_per_second": 403.255,
-      "eval_steps_per_second": 51.422,
       "step": 54
     }
   ],
@@ -42,6 +42,15 @@
   "num_train_epochs": 3,
   "save_steps": 500,
   "stateful_callbacks": {
     "TrainerControl": {
       "args": {
         "should_epoch_stop": false,

 {
+  "best_metric": 0.7919463087248322,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-54",
   "epoch": 2.899328859060403,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.7078245878219604,
+      "eval_runtime": 3.2791,
+      "eval_samples_per_second": 90.879,
+      "eval_steps_per_second": 11.589,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
+      "eval_accuracy": 0.785234899328859,
+      "eval_loss": 0.5109438300132751,
+      "eval_runtime": 3.2689,
+      "eval_samples_per_second": 91.162,
+      "eval_steps_per_second": 11.625,
       "step": 37
     },
     {
       "epoch": 2.899328859060403,
+      "eval_accuracy": 0.7919463087248322,
+      "eval_loss": 0.4952092468738556,
+      "eval_runtime": 3.2798,
+      "eval_samples_per_second": 90.86,
+      "eval_steps_per_second": 11.586,
       "step": 54
     }
   ],
   "num_train_epochs": 3,
   "save_steps": 500,
   "stateful_callbacks": {
+    "EarlyStoppingCallback": {
+      "args": {
+        "early_stopping_patience": 3,
+        "early_stopping_threshold": 0.0
+      },
+      "attributes": {
+        "early_stopping_patience_counter": 0
+      }
+    },
     "TrainerControl": {
       "args": {
         "should_epoch_stop": false,

checkpoint-54/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1c7f4e93a08117554edcac2e7ce68e97841c557e813806f1446c2ab82115baa2
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:3b74ef34b0c98ff8e2f446712958b67f0e9dae2c892b2706cb653c6c4a3cba29
 size 5240

checkpoint-55/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c618f9432ed808a829c6b9322c2de70a050c8d68461263263a1ef25ba955522c
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:6a56d023e24e3488758e046ad64c3d057691e3e8253a5b6f0139dfed87b15032
 size 94765560

checkpoint-55/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eda417e68dbd70d3239f0ee68d35d7c4c4ee28b662a007739b7af7e3358a7b8b
 size 189556666

 version https://git-lfs.github.com/spec/v1
+oid sha256:6832cd2547bddccb9a7ffc2cd19f963924a3e7ae37e344b0759b7f9310c16429
 size 189556666

checkpoint-55/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b619a76e273bca0b5b749883b1e3347edb727e6888b4090828a33e2f2c1f4fae
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:d4af5c5db34db2bd723c5a003a705e44ba7bbc4acb13f2908560574feaabe15f
 size 1064

checkpoint-55/trainer_state.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "best_metric": 0.8087248322147651,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-55",
   "epoch": 2.953020134228188,
   "eval_steps": 500,
@@ -10,45 +10,36 @@
   "log_history": [
     {
       "epoch": 0.9664429530201343,
-      "eval_accuracy": 0.7818791946308725,
-      "eval_f1": 0.7264205130236912,
-      "eval_loss": 0.669560968875885,
-      "eval_precision": 0.689807639599501,
-      "eval_recall": 0.7818791946308725,
-      "eval_runtime": 0.9033,
-      "eval_samples_per_second": 329.896,
-      "eval_steps_per_second": 42.067,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
-      "eval_accuracy": 0.7751677852348994,
-      "eval_f1": 0.7202681570933687,
-      "eval_loss": 0.5067932605743408,
-      "eval_precision": 0.684911313518696,
-      "eval_recall": 0.7751677852348994,
-      "eval_runtime": 0.907,
-      "eval_samples_per_second": 328.546,
-      "eval_steps_per_second": 41.895,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
-      "eval_accuracy": 0.8087248322147651,
-      "eval_f1": 0.7535236037076262,
-      "eval_loss": 0.43038079142570496,
-      "eval_precision": 0.7241626365959,
-      "eval_recall": 0.8087248322147651,
-      "eval_runtime": 0.8664,
-      "eval_samples_per_second": 343.963,
-      "eval_steps_per_second": 43.861,
       "step": 55
     }
   ],
   "logging_steps": 500,
-  "max_steps": 216,
   "num_input_tokens_seen": 0,
-  "num_train_epochs": 12,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

 {
+  "best_metric": 0.8187919463087249,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-55",
   "epoch": 2.953020134228188,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.686046838760376,
+      "eval_runtime": 3.2719,
+      "eval_samples_per_second": 91.079,
+      "eval_steps_per_second": 11.614,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
+      "eval_accuracy": 0.802013422818792,
+      "eval_loss": 0.46226799488067627,
+      "eval_runtime": 3.3286,
+      "eval_samples_per_second": 89.527,
+      "eval_steps_per_second": 11.416,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
+      "eval_accuracy": 0.8187919463087249,
+      "eval_loss": 0.4068666100502014,
+      "eval_runtime": 3.2087,
+      "eval_samples_per_second": 92.871,
+      "eval_steps_per_second": 11.843,
       "step": 55
     }
   ],
   "logging_steps": 500,
+  "max_steps": 126,
   "num_input_tokens_seen": 0,
+  "num_train_epochs": 7,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

checkpoint-55/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:296be9afae72ab3934d873f0cf92f87ef76899c18b11651de670afb49aa1a5d6
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:4320ed7eb3857f3356f3c0fd71b66d450b29bc6f61001ac820f978865e977454
 size 5240

checkpoint-74/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a65df2df2e438591f2a6638db08aa114327846410ac2c5aa11c45b4447cc6349
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:0a5d3bc1abf730eda54b1adbdc9aa43335bea7850d95611eeed903f77b34a044
 size 94765560

checkpoint-74/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a56095d9e4fb10f1821c0613eb1e41b10331d6c0ba1a8e9c13f26b97709b275c
 size 189556666

 version https://git-lfs.github.com/spec/v1
+oid sha256:2ae95457eed519491999c6039260cf9d6720ed842d5964a72039871a9c30af7e
 size 189556666

checkpoint-74/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:33230a8f5e96901b9c18a66475024a2d6a7717cc358ba599df0c422ae9370052
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:e394f0e7c3dcb107b5480b00977436132ee632367118f6b993703acf51b4f795
 size 1064

checkpoint-74/trainer_state.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "best_metric": 0.8523489932885906,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-74",
   "epoch": 3.9731543624161074,
   "eval_steps": 500,
@@ -10,57 +10,45 @@
   "log_history": [
     {
       "epoch": 0.9664429530201343,
-      "eval_accuracy": 0.7818791946308725,
-      "eval_f1": 0.7264205130236912,
-      "eval_loss": 0.669560968875885,
-      "eval_precision": 0.689807639599501,
-      "eval_recall": 0.7818791946308725,
-      "eval_runtime": 0.9033,
-      "eval_samples_per_second": 329.896,
-      "eval_steps_per_second": 42.067,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
-      "eval_accuracy": 0.7751677852348994,
-      "eval_f1": 0.7202681570933687,
-      "eval_loss": 0.5067932605743408,
-      "eval_precision": 0.684911313518696,
-      "eval_recall": 0.7751677852348994,
-      "eval_runtime": 0.907,
-      "eval_samples_per_second": 328.546,
-      "eval_steps_per_second": 41.895,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
-      "eval_accuracy": 0.8087248322147651,
-      "eval_f1": 0.7535236037076262,
-      "eval_loss": 0.43038079142570496,
-      "eval_precision": 0.7241626365959,
-      "eval_recall": 0.8087248322147651,
-      "eval_runtime": 0.8664,
-      "eval_samples_per_second": 343.963,
-      "eval_steps_per_second": 43.861,
       "step": 55
     },
     {
       "epoch": 3.9731543624161074,
-      "eval_accuracy": 0.8523489932885906,
-      "eval_f1": 0.8433916249277822,
-      "eval_loss": 0.4109182059764862,
-      "eval_precision": 0.8727817866814688,
-      "eval_recall": 0.8523489932885906,
-      "eval_runtime": 0.8712,
-      "eval_samples_per_second": 342.059,
-      "eval_steps_per_second": 43.618,
       "step": 74
     }
   ],
   "logging_steps": 500,
-  "max_steps": 216,
   "num_input_tokens_seen": 0,
-  "num_train_epochs": 12,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

 {
+  "best_metric": 0.8355704697986577,
   "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-74",
   "epoch": 3.9731543624161074,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.686046838760376,
+      "eval_runtime": 3.2719,
+      "eval_samples_per_second": 91.079,
+      "eval_steps_per_second": 11.614,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
+      "eval_accuracy": 0.802013422818792,
+      "eval_loss": 0.46226799488067627,
+      "eval_runtime": 3.3286,
+      "eval_samples_per_second": 89.527,
+      "eval_steps_per_second": 11.416,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
+      "eval_accuracy": 0.8187919463087249,
+      "eval_loss": 0.4068666100502014,
+      "eval_runtime": 3.2087,
+      "eval_samples_per_second": 92.871,
+      "eval_steps_per_second": 11.843,
       "step": 55
     },
     {
       "epoch": 3.9731543624161074,
+      "eval_accuracy": 0.8355704697986577,
+      "eval_loss": 0.3811332583427429,
+      "eval_runtime": 3.2325,
+      "eval_samples_per_second": 92.188,
+      "eval_steps_per_second": 11.755,
       "step": 74
     }
   ],
   "logging_steps": 500,
+  "max_steps": 126,
   "num_input_tokens_seen": 0,
+  "num_train_epochs": 7,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

checkpoint-74/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:296be9afae72ab3934d873f0cf92f87ef76899c18b11651de670afb49aa1a5d6
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:4320ed7eb3857f3356f3c0fd71b66d450b29bc6f61001ac820f978865e977454
 size 5240

checkpoint-93/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0b1278f478e484fe8966052e7cedbe2cd627eca9979ce11b06e5b8f823427f29
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:0d3d69c192989ce2813f8d99d6435c07783c8a6f57b3311dc42e47e0b875e811
 size 94765560

checkpoint-93/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eb3c381477600c922a9b63900e0bc0281efab5e1d3889d5971688dd7b8631338
 size 189556666

 version https://git-lfs.github.com/spec/v1
+oid sha256:a1ce9c56f3a961f219bca2e4bb08198e6845fd46ebfc30a3de72b3247c77dcea
 size 189556666

checkpoint-93/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b5a98f38af6530b97608c99604027e7e1fd08ea62f92b5d26f4379a4874e5a6e
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:c2bb70a20d69c42e89da01454789086991cde012c396cfc3a8588724e4a08637
 size 1064

checkpoint-93/trainer_state.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
-  "best_metric": 0.87248322147651,
-  "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-93",
   "epoch": 4.993288590604027,
   "eval_steps": 500,
   "global_step": 93,
@@ -10,69 +10,54 @@
   "log_history": [
     {
       "epoch": 0.9664429530201343,
-      "eval_accuracy": 0.7818791946308725,
-      "eval_f1": 0.7264205130236912,
-      "eval_loss": 0.669560968875885,
-      "eval_precision": 0.689807639599501,
-      "eval_recall": 0.7818791946308725,
-      "eval_runtime": 0.9033,
-      "eval_samples_per_second": 329.896,
-      "eval_steps_per_second": 42.067,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
-      "eval_accuracy": 0.7751677852348994,
-      "eval_f1": 0.7202681570933687,
-      "eval_loss": 0.5067932605743408,
-      "eval_precision": 0.684911313518696,
-      "eval_recall": 0.7751677852348994,
-      "eval_runtime": 0.907,
-      "eval_samples_per_second": 328.546,
-      "eval_steps_per_second": 41.895,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
-      "eval_accuracy": 0.8087248322147651,
-      "eval_f1": 0.7535236037076262,
-      "eval_loss": 0.43038079142570496,
-      "eval_precision": 0.7241626365959,
-      "eval_recall": 0.8087248322147651,
-      "eval_runtime": 0.8664,
-      "eval_samples_per_second": 343.963,
-      "eval_steps_per_second": 43.861,
       "step": 55
     },
     {
       "epoch": 3.9731543624161074,
-      "eval_accuracy": 0.8523489932885906,
-      "eval_f1": 0.8433916249277822,
-      "eval_loss": 0.4109182059764862,
-      "eval_precision": 0.8727817866814688,
-      "eval_recall": 0.8523489932885906,
-      "eval_runtime": 0.8712,
-      "eval_samples_per_second": 342.059,
-      "eval_steps_per_second": 43.618,
       "step": 74
     },
     {
       "epoch": 4.993288590604027,
-      "eval_accuracy": 0.87248322147651,
-      "eval_f1": 0.8717711524765707,
-      "eval_loss": 0.3263051509857178,
-      "eval_precision": 0.8718521382399975,
-      "eval_recall": 0.87248322147651,
-      "eval_runtime": 0.87,
-      "eval_samples_per_second": 342.548,
-      "eval_steps_per_second": 43.681,
       "step": 93
     }
   ],
   "logging_steps": 500,
-  "max_steps": 216,
   "num_input_tokens_seen": 0,
-  "num_train_epochs": 12,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

 {
+  "best_metric": 0.8355704697986577,
+  "best_model_checkpoint": "distilhubert-finetuned-mixed-data/checkpoint-74",
   "epoch": 4.993288590604027,
   "eval_steps": 500,
   "global_step": 93,
   "log_history": [
     {
       "epoch": 0.9664429530201343,
+      "eval_accuracy": 0.7583892617449665,
+      "eval_loss": 0.686046838760376,
+      "eval_runtime": 3.2719,
+      "eval_samples_per_second": 91.079,
+      "eval_steps_per_second": 11.614,
       "step": 18
     },
     {
       "epoch": 1.9865771812080537,
+      "eval_accuracy": 0.802013422818792,
+      "eval_loss": 0.46226799488067627,
+      "eval_runtime": 3.3286,
+      "eval_samples_per_second": 89.527,
+      "eval_steps_per_second": 11.416,
       "step": 37
     },
     {
       "epoch": 2.953020134228188,
+      "eval_accuracy": 0.8187919463087249,
+      "eval_loss": 0.4068666100502014,
+      "eval_runtime": 3.2087,
+      "eval_samples_per_second": 92.871,
+      "eval_steps_per_second": 11.843,
       "step": 55
     },
     {
       "epoch": 3.9731543624161074,
+      "eval_accuracy": 0.8355704697986577,
+      "eval_loss": 0.3811332583427429,
+      "eval_runtime": 3.2325,
+      "eval_samples_per_second": 92.188,
+      "eval_steps_per_second": 11.755,
       "step": 74
     },
     {
       "epoch": 4.993288590604027,
+      "eval_accuracy": 0.8355704697986577,
+      "eval_loss": 0.3542439937591553,
+      "eval_runtime": 3.2746,
+      "eval_samples_per_second": 91.003,
+      "eval_steps_per_second": 11.604,
       "step": 93
     }
   ],
   "logging_steps": 500,
+  "max_steps": 126,
   "num_input_tokens_seen": 0,
+  "num_train_epochs": 7,
   "save_steps": 500,
   "stateful_callbacks": {
     "EarlyStoppingCallback": {

checkpoint-93/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:296be9afae72ab3934d873f0cf92f87ef76899c18b11651de670afb49aa1a5d6
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:4320ed7eb3857f3356f3c0fd71b66d450b29bc6f61001ac820f978865e977454
 size 5240

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:add11297401b595ef24d8a66b9cf4f5bca92dae882193b715a5917dff07be685
 size 94765560

 version https://git-lfs.github.com/spec/v1
+oid sha256:5aa09bf08a82037b1a7e97fc26fb24670bc804169b2866e90999fb465178164d
 size 94765560

runs/Sep02_21-37-15_ubumarcos/events.out.tfevents.1725305838.ubumarcos ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3bd2942fa11e2b96ec0bf193c02a3d90f545996ac5d6c9a2a1d79dc5b7e274c1
+size 6562

runs/Sep02_23-16-29_ubumarcos/events.out.tfevents.1725311792.ubumarcos ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ce158ecf28ec84ea36b857f71a855b6145bd29512438400faabeae605fb1f97d
+size 6562

runs/Sep02_23-18-00_ubumarcos/events.out.tfevents.1725311883.ubumarcos ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f455c26277a30f5cb10c5e6da88cd4c1e37afa9b14eb98e3175e7db787029f36
+size 8464

runs/Sep03_00-14-06_ubumarcos/events.out.tfevents.1725315248.ubumarcos ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:31fa2415a5ff41201a78bc2d0fc4876051bc72fc9ea9d0a62bac3429895a1c37
+size 5897

runs/Sep03_13-16-32_ubumarcos/events.out.tfevents.1725362195.ubumarcos ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e860ae2b515ea6737e89cdf0074e7f5ae63319c4e78e1224a05d1342ac0a1d07
+size 5897

runs/Sep03_13-18-34_ubumarcos/events.out.tfevents.1725362316.ubumarcos ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0583e8a36e90e098e7c7ce2b2f54d745eb63200aa50d8790a3328390e051822a
+size 6562