MF21377197 commited on
Commit
d6f66d6
1 Parent(s): 5be4776

End of training

Browse files
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: nvidia/mit-b0
4
+ tags:
5
+ - vision
6
+ - image-segmentation
7
+ - generated_from_trainer
8
+ model-index:
9
+ - name: segformer-b0-finetuned
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # segformer-b0-finetuned
17
+
18
+ This model is a fine-tuned version of [nvidia/mit-b0](https://huggingface.co/nvidia/mit-b0) on the segments/sidewalk-semantic dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.5754
21
+ - Mean Iou: 0.2781
22
+ - Mean Accuracy: 0.3329
23
+ - Overall Accuracy: 0.8463
24
+ - Per Category Iou: [0.0, 0.7432161129630078, 0.854265404236928, 0.4606401052721709, 0.6557337899613191, 0.4079867997829282, nan, 0.37471812939221005, 0.2905341043386837, 0.0, 0.7537587486511262, 0.0, 0.0, nan, 0.0, 0.019848656872972055, 0.0, 0.0, 0.7115931639469374, nan, 0.3661808713379434, 0.13378413732653244, 0.0, nan, 0.0, 0.23570903658727577, 0.0, 0.0, 0.8461792428096935, 0.7553019453875489, 0.9045825383881589, 0.0, 0.0, 0.10651182264386322, 0.0]
25
+ - Per Category Accuracy: [0.0, 0.8511274737458464, 0.9523527728262475, 0.7305783824446481, 0.7179823443918317, 0.5112934364530293, nan, 0.4671955914617317, 0.39620749876026823, 0.0, 0.9325380267720194, 0.0, 0.0, nan, 0.0, 0.019920987025907694, 0.0, 0.0, 0.9114075726560573, nan, 0.4767221960460328, 0.14080931640440494, 0.0, nan, 0.0, 0.2902864462270403, 0.0, 0.0, 0.9417630123717813, 0.8946072183599384, 0.9626510283976625, 0.0, 0.0, 0.12104456389804058, 0.0]
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 5e-05
45
+ - train_batch_size: 2
46
+ - eval_batch_size: 2
47
+ - seed: 42
48
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 5
51
+
52
+ ### Training results
53
+
54
+ | Training Loss | Epoch | Step | Validation Loss | Mean Iou | Mean Accuracy | Overall Accuracy | Per Category Iou | Per Category Accuracy |
55
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:-------------:|:----------------:|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|
56
+ | 0.6702 | 1.0 | 400 | 0.7027 | 0.2195 | 0.2629 | 0.8084 | [0.0, 0.6792133271879363, 0.7849474176188894, 0.058328120930117175, 0.6300690246185523, 0.25673461142351706, nan, 0.3004378389548008, 0.0, 0.0, 0.6990982425959871, 0.0, 0.0, nan, 0.0, 0.0, 0.0, 0.0, 0.6732565754528083, nan, 0.2872208279956378, 0.00018886917594771138, 0.0, nan, 0.0, 0.04832964515878562, 0.0, 0.0, 0.8103176781510323, 0.6868793807107686, 0.8837386972387465, 0.0, 0.0, 0.005328297121957592, 0.0] | [0.0, 0.766565968404771, 0.9690642801889813, 0.05881921997258986, 0.6774284161746986, 0.2997799346472589, nan, 0.3604784648706302, 0.0, 0.0, 0.9238506174053699, 0.0, 0.0, nan, 0.0, 0.0, 0.0, 0.0, 0.879793151515724, nan, 0.3814629549547224, 0.0001889329509116015, 0.0, nan, 0.0, 0.049252727470549255, 0.0, 0.0, 0.92204060605898, 0.9165746690826654, 0.9399161153753854, 0.0, 0.0, 0.005432287151031732, 0.0] |
57
+ | 0.3787 | 2.0 | 800 | 0.6242 | 0.2529 | 0.3048 | 0.8336 | [0.0, 0.7200155470846057, 0.8500725277201905, 0.4283923004744409, 0.6393695507210657, 0.35502977816991826, nan, 0.35184539253673836, 0.007092160389449536, 0.0, 0.7043921336269122, 0.0, 0.0, nan, 0.0, 0.0, 0.0, 0.0, 0.6907551328541987, nan, 0.30912243068319983, 0.08139332433133045, 0.0, nan, 0.0, 0.188100947571913, 0.0, 0.0, 0.8385996121617959, 0.7447284921504436, 0.8944303097872178, 0.0, 0.0, 0.03573598106370054, 0.0] | [0.0, 0.8883665924421738, 0.9304929976181545, 0.6379432034836245, 0.684408431327974, 0.47546406343261166, nan, 0.46904807570859863, 0.007098812013540027, 0.0, 0.9393400531924911, 0.0, 0.0, nan, 0.0, 0.0, 0.0, 0.0, 0.9030339520321862, nan, 0.38990531390988364, 0.08456558485802895, 0.0, nan, 0.0, 0.22015360876747014, 0.0, 0.0, 0.9441960488646456, 0.8731757915405423, 0.9605664489132197, 0.0, 0.0, 0.0402316629096584, 0.0] |
58
+ | 0.5272 | 3.0 | 1200 | 0.5910 | 0.2640 | 0.3116 | 0.8409 | [0.0, 0.7417372358583919, 0.8491040334276788, 0.5026923409983705, 0.6531274799797995, 0.39671746797276214, nan, 0.3489393985838212, 0.10917691003765771, 0.0, 0.7340253134142348, 0.0, 0.0, nan, 0.0, 0.0015906711475390872, 0.0, 0.0, 0.6888929372303201, nan, 0.3000933215998536, 0.09430463167198322, 0.0, nan, 0.0, 0.22079460109263602, 0.0, 0.0, 0.8375430585634192, 0.7432674869295846, 0.8979452744557967, 0.0, 0.0, 0.06465850622764936, 0.0] | [0.0, 0.8742236634719254, 0.9469015516965127, 0.6845976124235456, 0.7153136135405865, 0.4838119970613863, nan, 0.426013369696913, 0.11595265302602359, 0.0, 0.9307602006332041, 0.0, 0.0, nan, 0.0, 0.0015906711475390872, 0.0, 0.0, 0.9211760279107508, nan, 0.3640489760089845, 0.09822101537391639, 0.0, nan, 0.0, 0.27522724799952525, 0.0, 0.0, 0.9529837430876597, 0.8328291532204077, 0.9626684254773068, 0.0, 0.0, 0.07271163516559737, 0.0] |
59
+ | 1.0028 | 4.0 | 1600 | 0.5819 | 0.2749 | 0.3265 | 0.8451 | [0.0, 0.7442319582171808, 0.8549546101758252, 0.4558465282708946, 0.6592549345415454, 0.40147520263994, nan, 0.3560786579426865, 0.2724675418610539, 0.0, 0.7615078761694535, 0.0, 0.0, nan, 0.0, 0.01480792989181263, 0.0, 0.0, 0.6971675525618446, nan, 0.3289306001269004, 0.1400376526683254, 0.0, nan, 0.0, 0.2330975509072671, 0.0, 0.0, 0.8412274001878343, 0.7610379911113287, 0.9042555512089849, 0.0, 0.0, 0.09514392306437187, 0.0] | [0.0, 0.8831147123698461, 0.9454704608007805, 0.7031575260834061, 0.7194367374187804, 0.4940337653992012, nan, 0.4313144685568867, 0.3654837110023501, 0.0, 0.911745245214873, 0.0, 0.0, nan, 0.0, 0.01483662361250094, 0.0, 0.0, 0.9249937012782616, nan, 0.39416005628239303, 0.1487585698177601, 0.0, nan, 0.0, 0.28903522220353906, 0.0, 0.0, 0.9446123320713813, 0.8885032163816529, 0.9544933330809051, 0.0, 0.0, 0.10758314548292006, 0.0] |
60
+ | 1.3105 | 5.0 | 2000 | 0.5754 | 0.2781 | 0.3329 | 0.8463 | [0.0, 0.7432161129630078, 0.854265404236928, 0.4606401052721709, 0.6557337899613191, 0.4079867997829282, nan, 0.37471812939221005, 0.2905341043386837, 0.0, 0.7537587486511262, 0.0, 0.0, nan, 0.0, 0.019848656872972055, 0.0, 0.0, 0.7115931639469374, nan, 0.3661808713379434, 0.13378413732653244, 0.0, nan, 0.0, 0.23570903658727577, 0.0, 0.0, 0.8461792428096935, 0.7553019453875489, 0.9045825383881589, 0.0, 0.0, 0.10651182264386322, 0.0] | [0.0, 0.8511274737458464, 0.9523527728262475, 0.7305783824446481, 0.7179823443918317, 0.5112934364530293, nan, 0.4671955914617317, 0.39620749876026823, 0.0, 0.9325380267720194, 0.0, 0.0, nan, 0.0, 0.019920987025907694, 0.0, 0.0, 0.9114075726560573, nan, 0.4767221960460328, 0.14080931640440494, 0.0, nan, 0.0, 0.2902864462270403, 0.0, 0.0, 0.9417630123717813, 0.8946072183599384, 0.9626510283976625, 0.0, 0.0, 0.12104456389804058, 0.0] |
61
+
62
+
63
+ ### Framework versions
64
+
65
+ - Transformers 4.38.2
66
+ - Pytorch 2.2.1+cu121
67
+ - Datasets 2.18.0
68
+ - Tokenizers 0.15.2
config.json ADDED
@@ -0,0 +1,144 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "nvidia/mit-b0",
3
+ "architectures": [
4
+ "SegformerForSemanticSegmentation"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "classifier_dropout_prob": 0.1,
8
+ "decoder_hidden_size": 256,
9
+ "depths": [
10
+ 2,
11
+ 2,
12
+ 2,
13
+ 2
14
+ ],
15
+ "downsampling_rates": [
16
+ 1,
17
+ 4,
18
+ 8,
19
+ 16
20
+ ],
21
+ "drop_path_rate": 0.1,
22
+ "hidden_act": "gelu",
23
+ "hidden_dropout_prob": 0.0,
24
+ "hidden_sizes": [
25
+ 32,
26
+ 64,
27
+ 160,
28
+ 256
29
+ ],
30
+ "id2label": {
31
+ "0": "unlabeled",
32
+ "1": "flat-road",
33
+ "2": "flat-sidewalk",
34
+ "3": "flat-crosswalk",
35
+ "4": "flat-cyclinglane",
36
+ "5": "flat-parkingdriveway",
37
+ "6": "flat-railtrack",
38
+ "7": "flat-curb",
39
+ "8": "human-person",
40
+ "9": "human-rider",
41
+ "10": "vehicle-car",
42
+ "11": "vehicle-truck",
43
+ "12": "vehicle-bus",
44
+ "13": "vehicle-tramtrain",
45
+ "14": "vehicle-motorcycle",
46
+ "15": "vehicle-bicycle",
47
+ "16": "vehicle-caravan",
48
+ "17": "vehicle-cartrailer",
49
+ "18": "construction-building",
50
+ "19": "construction-door",
51
+ "20": "construction-wall",
52
+ "21": "construction-fenceguardrail",
53
+ "22": "construction-bridge",
54
+ "23": "construction-tunnel",
55
+ "24": "construction-stairs",
56
+ "25": "object-pole",
57
+ "26": "object-trafficsign",
58
+ "27": "object-trafficlight",
59
+ "28": "nature-vegetation",
60
+ "29": "nature-terrain",
61
+ "30": "sky",
62
+ "31": "void-ground",
63
+ "32": "void-dynamic",
64
+ "33": "void-static",
65
+ "34": "void-unclear"
66
+ },
67
+ "image_size": 224,
68
+ "initializer_range": 0.02,
69
+ "label2id": {
70
+ "construction-bridge": 22,
71
+ "construction-building": 18,
72
+ "construction-door": 19,
73
+ "construction-fenceguardrail": 21,
74
+ "construction-stairs": 24,
75
+ "construction-tunnel": 23,
76
+ "construction-wall": 20,
77
+ "flat-crosswalk": 3,
78
+ "flat-curb": 7,
79
+ "flat-cyclinglane": 4,
80
+ "flat-parkingdriveway": 5,
81
+ "flat-railtrack": 6,
82
+ "flat-road": 1,
83
+ "flat-sidewalk": 2,
84
+ "human-person": 8,
85
+ "human-rider": 9,
86
+ "nature-terrain": 29,
87
+ "nature-vegetation": 28,
88
+ "object-pole": 25,
89
+ "object-trafficlight": 27,
90
+ "object-trafficsign": 26,
91
+ "sky": 30,
92
+ "unlabeled": 0,
93
+ "vehicle-bicycle": 15,
94
+ "vehicle-bus": 12,
95
+ "vehicle-car": 10,
96
+ "vehicle-caravan": 16,
97
+ "vehicle-cartrailer": 17,
98
+ "vehicle-motorcycle": 14,
99
+ "vehicle-tramtrain": 13,
100
+ "vehicle-truck": 11,
101
+ "void-dynamic": 32,
102
+ "void-ground": 31,
103
+ "void-static": 33,
104
+ "void-unclear": 34
105
+ },
106
+ "layer_norm_eps": 1e-06,
107
+ "mlp_ratios": [
108
+ 4,
109
+ 4,
110
+ 4,
111
+ 4
112
+ ],
113
+ "model_type": "segformer",
114
+ "num_attention_heads": [
115
+ 1,
116
+ 2,
117
+ 5,
118
+ 8
119
+ ],
120
+ "num_channels": 3,
121
+ "num_encoder_blocks": 4,
122
+ "patch_sizes": [
123
+ 7,
124
+ 3,
125
+ 3,
126
+ 3
127
+ ],
128
+ "reshape_last_stage": true,
129
+ "semantic_loss_ignore_index": 255,
130
+ "sr_ratios": [
131
+ 8,
132
+ 4,
133
+ 2,
134
+ 1
135
+ ],
136
+ "strides": [
137
+ 4,
138
+ 2,
139
+ 2,
140
+ 2
141
+ ],
142
+ "torch_dtype": "float32",
143
+ "transformers_version": "4.38.2"
144
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b47a41f2af4b878220cb3b6da94caa7acbe2ef0916608224c941ee6ae51b391f
3
+ size 14918708
runs/Apr10_14-36-28_7bd6b007cc7b/events.out.tfevents.1712759807.7bd6b007cc7b.3348.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:706727356e5dd2b6882a79fb08eeb2f575472e2454f16f9e754553463ad0d532
3
+ size 6870
runs/Apr10_14-36-28_7bd6b007cc7b/events.out.tfevents.1712759834.7bd6b007cc7b.3348.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f257f14e4d31b840de5e95f33fb4bfdbdb8635921ccb75a5adb067c2b0a4212d
3
+ size 6870
runs/Apr10_14-36-28_7bd6b007cc7b/events.out.tfevents.1712759869.7bd6b007cc7b.3348.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:83cd95be7803d45707e02c47ac1a9f131ae5edffc3a12880ff7f4e6d61dc1173
3
+ size 175602
runs/Apr10_14-54-04_7bd6b007cc7b/events.out.tfevents.1712760855.7bd6b007cc7b.3348.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f153fdcc540e3494101fe1310d819f0260cdcdf3939e77015f3029aab66e8929
3
+ size 430916
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b9ad9a4acdb55bddb2f2428318b07daa202268063ea95bc9b9e4761ea79f8282
3
+ size 4920