End of training

Browse files

Files changed (6) hide show

README.md +127 -0
config.json +40 -0
model.safetensors +3 -0
preprocessor_config.json +22 -0
runs/Apr04_16-43-59_9661203133a4/events.out.tfevents.1712249040.9661203133a4.161.0 +3 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,127 @@

+---
+license: apache-2.0
+base_model: google/vit-base-patch16-224-in21k
+tags:
+- generated_from_trainer
+datasets:
+- imagefolder
+metrics:
+- accuracy
+model-index:
+- name: Chess_Images
+  results:
+  - task:
+      name: Image Classification
+      type: image-classification
+    dataset:
+      name: imagefolder
+      type: imagefolder
+      config: default
+      split: train
+      args: default
+    metrics:
+    - name: Accuracy
+      type: accuracy
+      value: 0.9666666666666667
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# Chess_Images
+This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the imagefolder dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.3882
+- Accuracy: 0.9667
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 64
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 50
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 1.0   | 2    | 1.7939          | 0.2      |
+| No log        | 2.0   | 4    | 1.7836          | 0.1333   |
+| No log        | 3.0   | 6    | 1.7646          | 0.0667   |
+| No log        | 4.0   | 8    | 1.6917          | 0.1333   |
+| 1.7382        | 5.0   | 10   | 1.6700          | 0.3333   |
+| 1.7382        | 6.0   | 12   | 1.5990          | 0.5      |
+| 1.7382        | 7.0   | 14   | 1.5424          | 0.4667   |
+| 1.7382        | 8.0   | 16   | 1.4673          | 0.6      |
+| 1.7382        | 9.0   | 18   | 1.4155          | 0.7333   |
+| 1.3754        | 10.0  | 20   | 1.3015          | 0.7      |
+| 1.3754        | 11.0  | 22   | 1.3055          | 0.6667   |
+| 1.3754        | 12.0  | 24   | 1.2209          | 0.7      |
+| 1.3754        | 13.0  | 26   | 1.0965          | 0.8      |
+| 1.3754        | 14.0  | 28   | 1.0976          | 0.7667   |
+| 0.9947        | 15.0  | 30   | 1.0388          | 0.8667   |
+| 0.9947        | 16.0  | 32   | 1.0757          | 0.7      |
+| 0.9947        | 17.0  | 34   | 0.9617          | 0.8      |
+| 0.9947        | 18.0  | 36   | 0.8713          | 0.8667   |
+| 0.9947        | 19.0  | 38   | 0.8803          | 0.8667   |
+| 0.7399        | 20.0  | 40   | 0.8257          | 0.8667   |
+| 0.7399        | 21.0  | 42   | 0.8740          | 0.8333   |
+| 0.7399        | 22.0  | 44   | 0.7554          | 0.9667   |
+| 0.7399        | 23.0  | 46   | 0.7581          | 0.8667   |
+| 0.7399        | 24.0  | 48   | 0.7983          | 0.8333   |
+| 0.5797        | 25.0  | 50   | 0.7052          | 0.9333   |
+| 0.5797        | 26.0  | 52   | 0.7930          | 0.8667   |
+| 0.5797        | 27.0  | 54   | 0.7511          | 0.8333   |
+| 0.5797        | 28.0  | 56   | 0.5578          | 0.9667   |
+| 0.5797        | 29.0  | 58   | 0.5771          | 0.9667   |
+| 0.4642        | 30.0  | 60   | 0.5641          | 0.9667   |
+| 0.4642        | 31.0  | 62   | 0.5368          | 0.9667   |
+| 0.4642        | 32.0  | 64   | 0.5313          | 0.9333   |
+| 0.4642        | 33.0  | 66   | 0.5521          | 0.9      |
+| 0.4642        | 34.0  | 68   | 0.5530          | 0.9333   |
+| 0.3813        | 35.0  | 70   | 0.5416          | 0.9      |
+| 0.3813        | 36.0  | 72   | 0.4796          | 0.9      |
+| 0.3813        | 37.0  | 74   | 0.4627          | 0.9667   |
+| 0.3813        | 38.0  | 76   | 0.4788          | 0.9667   |
+| 0.3813        | 39.0  | 78   | 0.5044          | 0.9      |
+| 0.3555        | 40.0  | 80   | 0.5886          | 0.8667   |
+| 0.3555        | 41.0  | 82   | 0.4892          | 0.9      |
+| 0.3555        | 42.0  | 84   | 0.5306          | 0.8333   |
+| 0.3555        | 43.0  | 86   | 0.5294          | 0.8333   |
+| 0.3555        | 44.0  | 88   | 0.5260          | 0.8667   |
+| 0.3441        | 45.0  | 90   | 0.4445          | 0.9667   |
+| 0.3441        | 46.0  | 92   | 0.4579          | 0.9      |
+| 0.3441        | 47.0  | 94   | 0.4390          | 0.9333   |
+| 0.3441        | 48.0  | 96   | 0.4139          | 0.9667   |
+| 0.3441        | 49.0  | 98   | 0.4820          | 0.9667   |
+| 0.3155        | 50.0  | 100  | 0.3882          | 0.9667   |
+### Framework versions
+- Transformers 4.38.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.18.0
+- Tokenizers 0.15.2

config.json ADDED Viewed

	@@ -0,0 +1,40 @@

+{
+  "_name_or_path": "google/vit-base-patch16-224-in21k",
+  "architectures": [
+    "ViTForImageClassification"
+  ],
+  "attention_probs_dropout_prob": 0.0,
+  "encoder_stride": 16,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.0,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "Black king",
+    "1": "Black knight",
+    "2": "Black queen",
+    "3": "White king",
+    "4": "White knight",
+    "5": "White queen"
+  },
+  "image_size": 224,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "Black king": "0",
+    "Black knight": "1",
+    "Black queen": "2",
+    "White king": "3",
+    "White knight": "4",
+    "White queen": "5"
+  },
+  "layer_norm_eps": 1e-12,
+  "model_type": "vit",
+  "num_attention_heads": 12,
+  "num_channels": 3,
+  "num_hidden_layers": 12,
+  "patch_size": 16,
+  "problem_type": "single_label_classification",
+  "qkv_bias": true,
+  "torch_dtype": "float32",
+  "transformers_version": "4.38.2"
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5707881086dd888b634f50070c2fcceae507926135b7b812e02d476ba8b9924c
+size 343236280

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,22 @@

+{
+  "do_normalize": true,
+  "do_rescale": true,
+  "do_resize": true,
+  "image_mean": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "image_processor_type": "ViTImageProcessor",
+  "image_std": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "resample": 2,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "height": 224,
+    "width": 224
+  }
+}

runs/Apr04_16-43-59_9661203133a4/events.out.tfevents.1712249040.9661203133a4.161.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:69f6cac504aafb64c61a3d2427a134964da119726bd3c2f2c416cc5bf04ab181
+size 23030

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e948f170379585625c6489fddfde5049016a7f7c8726763701243de816466c8d
+size 4920