Training in progress, epoch 1

Browse files

Files changed (4) hide show

README.md +71 -71
config.json +80 -80
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,71 +1,71 @@
----
-license: other
-base_model: nvidia/mit-b3
-tags:
-- generated_from_trainer
-model-index:
-- name: segformer-roof
-  results: []
----
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# segformer-roof
-This model is a fine-tuned version of [nvidia/mit-b3](https://huggingface.co/nvidia/mit-b3) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.1331
-- Mean Iou: 0.6705
-- Mean Accuracy: 0.7459
-- Overall Accuracy: 0.9509
-- Per Category Iou: [0.9501174871140823, 0.44914356298751956, 0.612314004780354]
-- Per Category Accuracy: [0.9835400780911927, 0.5547023488814158, 0.6996065789512536]
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 6e-05
-- train_batch_size: 8
-- eval_batch_size: 8
-- seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
-- num_epochs: 10
-### Training results
-| Training Loss | Epoch | Step | Validation Loss | Mean Iou | Mean Accuracy | Overall Accuracy | Per Category Iou                                              | Per Category Accuracy                                         |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:-------------:|:----------------:|:-------------------------------------------------------------:|:-------------------------------------------------------------:|
-| 0.2357        | 1.0   | 930  | 0.1790          | 0.5725   | 0.6427        | 0.9370           | [0.9372296551214157, 0.265369517396097, 0.5150004888948623]   | [0.9831748407739025, 0.30066901406516805, 0.6442311056295564] |
-| 0.1732        | 2.0   | 1860 | 0.1700          | 0.5944   | 0.6623        | 0.9398           | [0.9401268036898566, 0.32807672100401025, 0.5150202791005837] | [0.9839554901228429, 0.3979733503813561, 0.6048359409441622]  |
-| 0.1559        | 3.0   | 2790 | 0.1600          | 0.6127   | 0.6753        | 0.9429           | [0.9427325757540327, 0.35965960312996503, 0.5357965624353673] | [0.9857619070576414, 0.448355090982288, 0.5917792010373596]   |
-| 0.1482        | 4.0   | 3720 | 0.1550          | 0.6070   | 0.6703        | 0.9437           | [0.944172393520638, 0.32282759103373876, 0.553957021364391]   | [0.9867471837507854, 0.3693595842127839, 0.6546589242888398]  |
-| 0.1388        | 5.0   | 4650 | 0.1459          | 0.6224   | 0.6804        | 0.9463           | [0.9459092954263936, 0.3388316205746287, 0.582315028654454]   | [0.9880828845164424, 0.3845685735297591, 0.6686608983285444]  |
-| 0.1311        | 6.0   | 5580 | 0.1462          | 0.6577   | 0.7466        | 0.9468           | [0.9461722360255241, 0.43412167635821636, 0.5928671376046599] | [0.9789455448939886, 0.5812880374397429, 0.679673538106026]   |
-| 0.1279        | 7.0   | 6510 | 0.1423          | 0.6611   | 0.7569        | 0.9469           | [0.9465891089044499, 0.4381184809600582, 0.5986930952368954]  | [0.9773687408863051, 0.6022321705637107, 0.6909579192751797]  |
-| 0.1232        | 8.0   | 7440 | 0.1388          | 0.6682   | 0.7548        | 0.9491           | [0.9484426711464405, 0.44975791193706466, 0.6064948465370358] | [0.9802070573066378, 0.5898759789294347, 0.6944388397098907]  |
-| 0.1175        | 9.0   | 8370 | 0.1353          | 0.6665   | 0.7392        | 0.9505           | [0.9497153698000098, 0.44248964964215, 0.6074386624389524]    | [0.9841990715683068, 0.5444332218950657, 0.6888275633995394]  |
-| 0.1174        | 10.0  | 9300 | 0.1331          | 0.6705   | 0.7459        | 0.9509           | [0.9501174871140823, 0.44914356298751956, 0.612314004780354]  | [0.9835400780911927, 0.5547023488814158, 0.6996065789512536]  |
-### Framework versions
-- Transformers 4.33.1
-- Pytorch 2.0.1
-- Datasets 2.14.5
-- Tokenizers 0.13.3

+---
+license: other
+base_model: nvidia/mit-b3
+tags:
+- generated_from_trainer
+model-index:
+- name: segformer-roof
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# segformer-roof
+This model is a fine-tuned version of [nvidia/mit-b3](https://huggingface.co/nvidia/mit-b3) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1331
+- Mean Iou: 0.6705
+- Mean Accuracy: 0.7459
+- Overall Accuracy: 0.9509
+- Per Category Iou: [0.9501174871140823, 0.44914356298751956, 0.612314004780354]
+- Per Category Accuracy: [0.9835400780911927, 0.5547023488814158, 0.6996065789512536]
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 6e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Mean Iou | Mean Accuracy | Overall Accuracy | Per Category Iou                                              | Per Category Accuracy                                         |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:-------------:|:----------------:|:-------------------------------------------------------------:|:-------------------------------------------------------------:|
+| 0.2357        | 1.0   | 930  | 0.1790          | 0.5725   | 0.6427        | 0.9370           | [0.9372296551214157, 0.265369517396097, 0.5150004888948623]   | [0.9831748407739025, 0.30066901406516805, 0.6442311056295564] |
+| 0.1732        | 2.0   | 1860 | 0.1700          | 0.5944   | 0.6623        | 0.9398           | [0.9401268036898566, 0.32807672100401025, 0.5150202791005837] | [0.9839554901228429, 0.3979733503813561, 0.6048359409441622]  |
+| 0.1559        | 3.0   | 2790 | 0.1600          | 0.6127   | 0.6753        | 0.9429           | [0.9427325757540327, 0.35965960312996503, 0.5357965624353673] | [0.9857619070576414, 0.448355090982288, 0.5917792010373596]   |
+| 0.1482        | 4.0   | 3720 | 0.1550          | 0.6070   | 0.6703        | 0.9437           | [0.944172393520638, 0.32282759103373876, 0.553957021364391]   | [0.9867471837507854, 0.3693595842127839, 0.6546589242888398]  |
+| 0.1388        | 5.0   | 4650 | 0.1459          | 0.6224   | 0.6804        | 0.9463           | [0.9459092954263936, 0.3388316205746287, 0.582315028654454]   | [0.9880828845164424, 0.3845685735297591, 0.6686608983285444]  |
+| 0.1311        | 6.0   | 5580 | 0.1462          | 0.6577   | 0.7466        | 0.9468           | [0.9461722360255241, 0.43412167635821636, 0.5928671376046599] | [0.9789455448939886, 0.5812880374397429, 0.679673538106026]   |
+| 0.1279        | 7.0   | 6510 | 0.1423          | 0.6611   | 0.7569        | 0.9469           | [0.9465891089044499, 0.4381184809600582, 0.5986930952368954]  | [0.9773687408863051, 0.6022321705637107, 0.6909579192751797]  |
+| 0.1232        | 8.0   | 7440 | 0.1388          | 0.6682   | 0.7548        | 0.9491           | [0.9484426711464405, 0.44975791193706466, 0.6064948465370358] | [0.9802070573066378, 0.5898759789294347, 0.6944388397098907]  |
+| 0.1175        | 9.0   | 8370 | 0.1353          | 0.6665   | 0.7392        | 0.9505           | [0.9497153698000098, 0.44248964964215, 0.6074386624389524]    | [0.9841990715683068, 0.5444332218950657, 0.6888275633995394]  |
+| 0.1174        | 10.0  | 9300 | 0.1331          | 0.6705   | 0.7459        | 0.9509           | [0.9501174871140823, 0.44914356298751956, 0.612314004780354]  | [0.9835400780911927, 0.5547023488814158, 0.6996065789512536]  |
+### Framework versions
+- Transformers 4.33.1
+- Pytorch 2.0.1
+- Datasets 2.14.5
+- Tokenizers 0.13.3

config.json CHANGED Viewed

@@ -1,80 +1,80 @@
-{
-  "_name_or_path": "nvidia/mit-b3",
-  "architectures": [
-    "SegformerForSemanticSegmentation"
-  ],
-  "attention_probs_dropout_prob": 0.0,
-  "classifier_dropout_prob": 0.1,
-  "decoder_hidden_size": 768,
-  "depths": [
-    3,
-    4,
-    18,
-    3
-  ],
-  "downsampling_rates": [
-    1,
-    4,
-    8,
-    16
-  ],
-  "drop_path_rate": 0.1,
-  "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.0,
-  "hidden_sizes": [
-    64,
-    128,
-    320,
-    512
-  ],
-  "id2label": {
-    "0": "\u0424\u043e\u043d",
-    "1": "\u0412\u0437\u0434\u0443\u0442\u0438\u0435",
-    "2": "\u0412\u043f\u0430\u0434\u0438\u043d\u0430"
-  },
-  "image_size": 224,
-  "initializer_range": 0.02,
-  "label2id": {
-    "\u0412\u0437\u0434\u0443\u0442\u0438\u0435": 1,
-    "\u0412\u043f\u0430\u0434\u0438\u043d\u0430": 2,
-    "\u0424\u043e\u043d": 0
-  },
-  "layer_norm_eps": 1e-06,
-  "mlp_ratios": [
-    4,
-    4,
-    4,
-    4
-  ],
-  "model_type": "segformer",
-  "num_attention_heads": [
-    1,
-    2,
-    5,
-    8
-  ],
-  "num_channels": 3,
-  "num_encoder_blocks": 4,
-  "patch_sizes": [
-    7,
-    3,
-    3,
-    3
-  ],
-  "reshape_last_stage": true,
-  "semantic_loss_ignore_index": 255,
-  "sr_ratios": [
-    8,
-    4,
-    2,
-    1
-  ],
-  "strides": [
-    4,
-    2,
-    2,
-    2
-  ],
-  "torch_dtype": "float32",
-  "transformers_version": "4.33.1"
-}

+{
+  "_name_or_path": "nvidia/mit-b3",
+  "architectures": [
+    "SegformerForSemanticSegmentation"
+  ],
+  "attention_probs_dropout_prob": 0.0,
+  "classifier_dropout_prob": 0.1,
+  "decoder_hidden_size": 768,
+  "depths": [
+    3,
+    4,
+    18,
+    3
+  ],
+  "downsampling_rates": [
+    1,
+    4,
+    8,
+    16
+  ],
+  "drop_path_rate": 0.1,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.0,
+  "hidden_sizes": [
+    64,
+    128,
+    320,
+    512
+  ],
+  "id2label": {
+    "0": "\u0424\u043e\u043d",
+    "1": "\u0412\u0437\u0434\u0443\u0442\u0438\u0435",
+    "2": "\u0412\u043f\u0430\u0434\u0438\u043d\u0430"
+  },
+  "image_size": 224,
+  "initializer_range": 0.02,
+  "label2id": {
+    "\u0412\u0437\u0434\u0443\u0442\u0438\u0435": 1,
+    "\u0412\u043f\u0430\u0434\u0438\u043d\u0430": 2,
+    "\u0424\u043e\u043d": 0
+  },
+  "layer_norm_eps": 1e-06,
+  "mlp_ratios": [
+    4,
+    4,
+    4,
+    4
+  ],
+  "model_type": "segformer",
+  "num_attention_heads": [
+    1,
+    2,
+    5,
+    8
+  ],
+  "num_channels": 3,
+  "num_encoder_blocks": 4,
+  "patch_sizes": [
+    7,
+    3,
+    3,
+    3
+  ],
+  "reshape_last_stage": true,
+  "semantic_loss_ignore_index": 255,
+  "sr_ratios": [
+    8,
+    4,
+    2,
+    1
+  ],
+  "strides": [
+    4,
+    2,
+    2,
+    2
+  ],
+  "torch_dtype": "float32",
+  "transformers_version": "4.33.1"
+}

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ca08b4feadd0d95335f2d07acb4cc8b389bbca0544d68dddd8b3c7ee6d09a6ab
 size 189128349

 version https://git-lfs.github.com/spec/v1
+oid sha256:5ee1dd3761971f29f1979225d069ecb03578e88305f90a9badbbce8755db4522
 size 189128349

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9b0fc43edae985d7abd087c646b3e9bd997213664ba18305b2d21f8c35c92fa7
 size 3963

 version https://git-lfs.github.com/spec/v1
+oid sha256:fe746e79814b9e4085e1dedadf74f6a977385b495ed803e33fd9fe586614c064
 size 3963