End of training

Browse files

Files changed (13) hide show

README.md +117 -0
all_results.json +22 -0
config.json +42 -0
eval_results.json +17 -0
model.safetensors +3 -0
preprocessor_config.json +23 -0
runs/Jan31_16-04-05_modal/events.out.tfevents.1738339446.modal.2.0 +3 -0
runs/Jan31_16-04-05_modal/events.out.tfevents.1738339446.modal.2.1 +3 -0
runs/Jan31_16-04-05_modal/events.out.tfevents.1738340724.modal.2.2 +3 -0
runs/Jan31_16-04-05_modal/events.out.tfevents.1738340724.modal.2.3 +3 -0
train_results.json +8 -0
trainer_state.json +0 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,117 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: google/vit-base-patch16-224-in21k
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+model-index:
+- name: squarerun
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# squarerun
+This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.3394
+- F1 Macro: 0.4627
+- F1 Micro: 0.5606
+- F1 Weighted: 0.5294
+- Precision Macro: 0.4704
+- Precision Micro: 0.5606
+- Precision Weighted: 0.5310
+- Recall Macro: 0.4855
+- Recall Micro: 0.5606
+- Recall Weighted: 0.5606
+- Accuracy: 0.5606
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 16
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 45
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 Micro | F1 Weighted | Precision Macro | Precision Micro | Precision Weighted | Recall Macro | Recall Micro | Recall Weighted | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:-----------:|:---------------:|:---------------:|:------------------:|:------------:|:------------:|:---------------:|:--------:|
+| 1.903         | 1.0   | 29   | 1.8868          | 0.0658   | 0.1742   | 0.0900      | 0.0502          | 0.1742          | 0.0693             | 0.1293       | 0.1742       | 0.1742          | 0.1742   |
+| 1.8662        | 2.0   | 58   | 1.8740          | 0.0754   | 0.2197   | 0.1004      | 0.0603          | 0.2197          | 0.0773             | 0.1580       | 0.2197       | 0.2197          | 0.2197   |
+| 1.9291        | 3.0   | 87   | 1.8862          | 0.0485   | 0.2045   | 0.0695      | 0.0292          | 0.2045          | 0.0418             | 0.1429       | 0.2045       | 0.2045          | 0.2045   |
+| 1.7838        | 4.0   | 116  | 1.8127          | 0.1171   | 0.2652   | 0.1474      | 0.1092          | 0.2652          | 0.1321             | 0.1973       | 0.2652       | 0.2652          | 0.2652   |
+| 1.7113        | 5.0   | 145  | 1.6979          | 0.2133   | 0.3485   | 0.2592      | 0.3189          | 0.3485          | 0.3631             | 0.2822       | 0.3485       | 0.3485          | 0.3485   |
+| 1.6459        | 6.0   | 174  | 1.5577          | 0.2714   | 0.3939   | 0.3225      | 0.4296          | 0.3939          | 0.4531             | 0.3198       | 0.3939       | 0.3939          | 0.3939   |
+| 1.4829        | 7.0   | 203  | 1.3814          | 0.4069   | 0.5227   | 0.4611      | 0.3786          | 0.5227          | 0.4216             | 0.4511       | 0.5227       | 0.5227          | 0.5227   |
+| 1.2847        | 8.0   | 232  | 1.3783          | 0.3675   | 0.4545   | 0.4176      | 0.4992          | 0.4545          | 0.5702             | 0.4080       | 0.4545       | 0.4545          | 0.4545   |
+| 0.7746        | 9.0   | 261  | 1.1536          | 0.4579   | 0.5758   | 0.5298      | 0.5301          | 0.5758          | 0.5896             | 0.4853       | 0.5758       | 0.5758          | 0.5758   |
+| 1.0172        | 10.0  | 290  | 1.2211          | 0.4700   | 0.5909   | 0.5365      | 0.5722          | 0.5909          | 0.6399             | 0.5182       | 0.5909       | 0.5909          | 0.5909   |
+| 0.7865        | 11.0  | 319  | 1.1357          | 0.5282   | 0.6136   | 0.5961      | 0.5342          | 0.6136          | 0.6009             | 0.5432       | 0.6136       | 0.6136          | 0.6136   |
+| 0.8335        | 12.0  | 348  | 1.1530          | 0.5315   | 0.6061   | 0.6017      | 0.5365          | 0.6061          | 0.6209             | 0.5489       | 0.6061       | 0.6061          | 0.6061   |
+| 0.6959        | 13.0  | 377  | 1.1307          | 0.5638   | 0.6667   | 0.6451      | 0.5912          | 0.6667          | 0.6615             | 0.5773       | 0.6667       | 0.6667          | 0.6667   |
+| 0.5864        | 14.0  | 406  | 1.1957          | 0.5211   | 0.5985   | 0.5894      | 0.5537          | 0.5985          | 0.6275             | 0.5389       | 0.5985       | 0.5985          | 0.5985   |
+| 0.6145        | 15.0  | 435  | 0.9957          | 0.6086   | 0.7045   | 0.6833      | 0.6164          | 0.7045          | 0.6791             | 0.6160       | 0.7045       | 0.7045          | 0.7045   |
+| 0.5632        | 16.0  | 464  | 1.2302          | 0.5112   | 0.5985   | 0.5781      | 0.5219          | 0.5985          | 0.5853             | 0.5236       | 0.5985       | 0.5985          | 0.5985   |
+| 0.3392        | 17.0  | 493  | 1.1925          | 0.5335   | 0.6288   | 0.6043      | 0.5903          | 0.6288          | 0.6435             | 0.5355       | 0.6288       | 0.6288          | 0.6288   |
+| 0.2998        | 18.0  | 522  | 1.1444          | 0.5544   | 0.6364   | 0.6251      | 0.5520          | 0.6364          | 0.6248             | 0.5670       | 0.6364       | 0.6364          | 0.6364   |
+| 0.2706        | 19.0  | 551  | 1.1072          | 0.5579   | 0.6439   | 0.6308      | 0.5790          | 0.6439          | 0.6404             | 0.5571       | 0.6439       | 0.6439          | 0.6439   |
+| 0.2012        | 20.0  | 580  | 1.1353          | 0.5278   | 0.6212   | 0.6012      | 0.5433          | 0.6212          | 0.6063             | 0.5346       | 0.6212       | 0.6212          | 0.6212   |
+| 0.532         | 21.0  | 609  | 1.2503          | 0.5421   | 0.6212   | 0.6079      | 0.5651          | 0.6212          | 0.6253             | 0.5488       | 0.6212       | 0.6212          | 0.6212   |
+| 0.0963        | 22.0  | 638  | 1.2203          | 0.5702   | 0.6288   | 0.6227      | 0.5807          | 0.6288          | 0.6327             | 0.5745       | 0.6288       | 0.6288          | 0.6288   |
+| 0.1076        | 23.0  | 667  | 1.3798          | 0.5216   | 0.6136   | 0.5894      | 0.5339          | 0.6136          | 0.5971             | 0.5370       | 0.6136       | 0.6136          | 0.6136   |
+| 0.1773        | 24.0  | 696  | 1.3129          | 0.5422   | 0.6288   | 0.6169      | 0.5581          | 0.6288          | 0.6253             | 0.5453       | 0.6288       | 0.6288          | 0.6288   |
+| 0.0598        | 25.0  | 725  | 1.2855          | 0.5633   | 0.6515   | 0.6381      | 0.5846          | 0.6515          | 0.6562             | 0.5713       | 0.6515       | 0.6515          | 0.6515   |
+| 0.0632        | 26.0  | 754  | 1.3155          | 0.6414   | 0.6591   | 0.6643      | 0.6525          | 0.6591          | 0.6925             | 0.6585       | 0.6591       | 0.6591          | 0.6591   |
+| 0.0644        | 27.0  | 783  | 1.3211          | 0.5588   | 0.6439   | 0.6315      | 0.5745          | 0.6439          | 0.6357             | 0.5595       | 0.6439       | 0.6439          | 0.6439   |
+| 0.1495        | 28.0  | 812  | 1.4196          | 0.5539   | 0.6364   | 0.6245      | 0.5650          | 0.6364          | 0.6270             | 0.5556       | 0.6364       | 0.6364          | 0.6364   |
+| 0.0413        | 29.0  | 841  | 1.4027          | 0.5378   | 0.6136   | 0.6102      | 0.5405          | 0.6136          | 0.6100             | 0.5380       | 0.6136       | 0.6136          | 0.6136   |
+| 0.0323        | 30.0  | 870  | 1.4302          | 0.5641   | 0.6364   | 0.6329      | 0.5689          | 0.6364          | 0.6430             | 0.5712       | 0.6364       | 0.6364          | 0.6364   |
+| 0.0452        | 31.0  | 899  | 1.4577          | 0.5706   | 0.6515   | 0.6412      | 0.5835          | 0.6515          | 0.6478             | 0.5738       | 0.6515       | 0.6515          | 0.6515   |
+| 0.0285        | 32.0  | 928  | 1.4224          | 0.5597   | 0.6439   | 0.6300      | 0.5618          | 0.6439          | 0.6250             | 0.5657       | 0.6439       | 0.6439          | 0.6439   |
+| 0.0241        | 33.0  | 957  | 1.4513          | 0.5542   | 0.6364   | 0.6252      | 0.5700          | 0.6364          | 0.6309             | 0.5533       | 0.6364       | 0.6364          | 0.6364   |
+| 0.0224        | 34.0  | 986  | 1.4701          | 0.5795   | 0.6742   | 0.6545      | 0.5856          | 0.6742          | 0.6523             | 0.5902       | 0.6742       | 0.6742          | 0.6742   |
+| 0.0228        | 35.0  | 1015 | 1.4697          | 0.5772   | 0.6591   | 0.6489      | 0.5870          | 0.6591          | 0.6497             | 0.5774       | 0.6591       | 0.6591          | 0.6591   |
+| 0.0231        | 36.0  | 1044 | 1.5315          | 0.5745   | 0.6591   | 0.6491      | 0.5783          | 0.6591          | 0.6483             | 0.5788       | 0.6591       | 0.6591          | 0.6591   |
+| 0.0457        | 37.0  | 1073 | 1.5210          | 0.5532   | 0.6439   | 0.6277      | 0.5641          | 0.6439          | 0.6317             | 0.5606       | 0.6439       | 0.6439          | 0.6439   |
+| 0.0197        | 38.0  | 1102 | 1.4956          | 0.5636   | 0.6515   | 0.6386      | 0.5590          | 0.6515          | 0.6296             | 0.5714       | 0.6515       | 0.6515          | 0.6515   |
+| 0.0219        | 39.0  | 1131 | 1.4910          | 0.5981   | 0.6591   | 0.6540      | 0.6063          | 0.6591          | 0.6554             | 0.5970       | 0.6591       | 0.6591          | 0.6591   |
+| 0.0212        | 40.0  | 1160 | 1.5050          | 0.5912   | 0.6515   | 0.6462      | 0.5997          | 0.6515          | 0.6472             | 0.5898       | 0.6515       | 0.6515          | 0.6515   |
+| 0.0212        | 41.0  | 1189 | 1.5091          | 0.5977   | 0.6591   | 0.6537      | 0.6080          | 0.6591          | 0.6558             | 0.5955       | 0.6591       | 0.6591          | 0.6591   |
+| 0.0202        | 42.0  | 1218 | 1.4961          | 0.5655   | 0.6515   | 0.6411      | 0.5708          | 0.6515          | 0.6411             | 0.5695       | 0.6515       | 0.6515          | 0.6515   |
+| 0.0216        | 43.0  | 1247 | 1.4917          | 0.5655   | 0.6515   | 0.6411      | 0.5708          | 0.6515          | 0.6411             | 0.5695       | 0.6515       | 0.6515          | 0.6515   |
+| 0.0199        | 44.0  | 1276 | 1.4855          | 0.5674   | 0.6515   | 0.6423      | 0.5694          | 0.6515          | 0.6401             | 0.5717       | 0.6515       | 0.6515          | 0.6515   |
+| 0.027         | 45.0  | 1305 | 1.4832          | 0.5674   | 0.6515   | 0.6423      | 0.5694          | 0.6515          | 0.6401             | 0.5717       | 0.6515       | 0.6515          | 0.6515   |
+### Framework versions
+- Transformers 4.48.1
+- Pytorch 2.5.1+cu124
+- Datasets 3.2.0
+- Tokenizers 0.21.0

all_results.json ADDED Viewed

	@@ -0,0 +1,22 @@

+{
+    "epoch": 45.0,
+    "eval_accuracy": 0.5606060606060606,
+    "eval_f1_macro": 0.46274430367680575,
+    "eval_f1_micro": 0.5606060606060606,
+    "eval_f1_weighted": 0.5293883851758615,
+    "eval_loss": 1.3394147157669067,
+    "eval_precision_macro": 0.4704431772709084,
+    "eval_precision_micro": 0.5606060606060606,
+    "eval_precision_weighted": 0.5309502588914353,
+    "eval_recall_macro": 0.48548752834467124,
+    "eval_recall_micro": 0.5606060606060606,
+    "eval_recall_weighted": 0.5606060606060606,
+    "eval_runtime": 1.2068,
+    "eval_samples_per_second": 54.688,
+    "eval_steps_per_second": 7.457,
+    "total_flos": 1.611130661124526e+18,
+    "train_loss": 0.5009927850430724,
+    "train_runtime": 1277.117,
+    "train_samples_per_second": 16.279,
+    "train_steps_per_second": 1.022
+}

config.json ADDED Viewed

	@@ -0,0 +1,42 @@

+{
+  "_name_or_path": "google/vit-base-patch16-224-in21k",
+  "architectures": [
+    "ViTForImageClassification"
+  ],
+  "attention_probs_dropout_prob": 0.0,
+  "encoder_stride": 16,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.0,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "-",
+    "1": "0",
+    "2": "1",
+    "3": "2",
+    "4": "3",
+    "5": "4",
+    "6": "5"
+  },
+  "image_size": 224,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "-": "0",
+    "0": "1",
+    "1": "2",
+    "2": "3",
+    "3": "4",
+    "4": "5",
+    "5": "6"
+  },
+  "layer_norm_eps": 1e-12,
+  "model_type": "vit",
+  "num_attention_heads": 12,
+  "num_channels": 3,
+  "num_hidden_layers": 12,
+  "patch_size": 16,
+  "problem_type": "single_label_classification",
+  "qkv_bias": true,
+  "torch_dtype": "float32",
+  "transformers_version": "4.48.1"
+}

eval_results.json ADDED Viewed

	@@ -0,0 +1,17 @@

+{
+    "epoch": 45.0,
+    "eval_accuracy": 0.5606060606060606,
+    "eval_f1_macro": 0.46274430367680575,
+    "eval_f1_micro": 0.5606060606060606,
+    "eval_f1_weighted": 0.5293883851758615,
+    "eval_loss": 1.3394147157669067,
+    "eval_precision_macro": 0.4704431772709084,
+    "eval_precision_micro": 0.5606060606060606,
+    "eval_precision_weighted": 0.5309502588914353,
+    "eval_recall_macro": 0.48548752834467124,
+    "eval_recall_micro": 0.5606060606060606,
+    "eval_recall_weighted": 0.5606060606060606,
+    "eval_runtime": 1.2068,
+    "eval_samples_per_second": 54.688,
+    "eval_steps_per_second": 7.457
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:40764115dc128d01def5740bc40aff3eb210950a1d37eea6aeb2a0a968bd7917
+size 343239356

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,23 @@

+{
+  "do_convert_rgb": null,
+  "do_normalize": true,
+  "do_rescale": true,
+  "do_resize": true,
+  "image_mean": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "image_processor_type": "ViTImageProcessorFast",
+  "image_std": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "resample": 2,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "height": 224,
+    "width": 224
+  }
+}

runs/Jan31_16-04-05_modal/events.out.tfevents.1738339446.modal.2.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:804589adf6ba795df1d4d59cbc34bd770121bc7a1c5a932f7dabae3253e65bbe
+size 180223

runs/Jan31_16-04-05_modal/events.out.tfevents.1738339446.modal.2.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ec0a2263b7a2a2881df3189667f467b47b758ba95f56e823636730a0185ec0b
+size 180223

runs/Jan31_16-04-05_modal/events.out.tfevents.1738340724.modal.2.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e9ad35e03be9d652596bceca36c9bbebcf19e6bb3fdb71d1748dd72b415ed419
+size 921

runs/Jan31_16-04-05_modal/events.out.tfevents.1738340724.modal.2.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9ba1dd627c1052c88c1c9dca879730f4c003c188ca5c7794f06b677dc526ac55
+size 921

train_results.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+    "epoch": 45.0,
+    "total_flos": 1.611130661124526e+18,
+    "train_loss": 0.5009927850430724,
+    "train_runtime": 1277.117,
+    "train_samples_per_second": 16.279,
+    "train_steps_per_second": 1.022
+}

trainer_state.json ADDED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0f3cd6bc1b90e400a15cc6f7d632ce82989dc36b5c59049a6b438eea0c91011e
+size 5368