File size: 13,858 Bytes
959537a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
---
license: apache-2.0
base_model: google/vit-base-patch16-224-in21k
tags:
- generated_from_trainer
metrics:
- accuracy
model-index:
- name: vit-base-patch16-224-in21k-finetune-os300_norm
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# vit-base-patch16-224-in21k-finetune-os300_norm

This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 0.3499
- Accuracy: 0.8577

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 0.005
- train_batch_size: 128
- eval_batch_size: 128
- seed: 42
- gradient_accumulation_steps: 4
- total_train_batch_size: 512
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 200
- mixed_precision_training: Native AMP

### Training results

| Training Loss | Epoch  | Step | Validation Loss | Accuracy |
|:-------------:|:------:|:----:|:---------------:|:--------:|
| 1.038         | 0.98   | 11   | 0.7215          | 0.6568   |
| 0.7212        | 1.96   | 22   | 0.7280          | 0.6568   |
| 0.7201        | 2.93   | 33   | 0.7285          | 0.6568   |
| 0.7308        | 4.0    | 45   | 0.7297          | 0.6568   |
| 0.7341        | 4.98   | 56   | 0.7277          | 0.6568   |
| 0.7255        | 5.96   | 67   | 0.7350          | 0.6568   |
| 0.7274        | 6.93   | 78   | 0.7258          | 0.6568   |
| 0.7189        | 8.0    | 90   | 0.7205          | 0.6568   |
| 0.7194        | 8.98   | 101  | 0.7117          | 0.6568   |
| 0.7437        | 9.96   | 112  | 0.7340          | 0.6568   |
| 0.7578        | 10.93  | 123  | 0.7317          | 0.6568   |
| 0.7307        | 12.0   | 135  | 0.7288          | 0.6568   |
| 0.7279        | 12.98  | 146  | 0.7246          | 0.6568   |
| 0.727         | 13.96  | 157  | 0.7166          | 0.6568   |
| 0.7161        | 14.93  | 168  | 0.7306          | 0.5117   |
| 0.6775        | 16.0   | 180  | 0.6360          | 0.6568   |
| 0.6487        | 16.98  | 191  | 0.6166          | 0.7113   |
| 0.607         | 17.96  | 202  | 0.5871          | 0.7240   |
| 0.5961        | 18.93  | 213  | 0.5606          | 0.7183   |
| 0.5681        | 20.0   | 225  | 0.5459          | 0.7381   |
| 0.5756        | 20.98  | 236  | 0.5375          | 0.7481   |
| 0.5666        | 21.96  | 247  | 0.5720          | 0.7042   |
| 0.5658        | 22.93  | 258  | 0.5127          | 0.7481   |
| 0.5461        | 24.0   | 270  | 0.5254          | 0.7360   |
| 0.5484        | 24.98  | 281  | 0.5124          | 0.7431   |
| 0.5442        | 25.96  | 292  | 0.5665          | 0.7282   |
| 0.5573        | 26.93  | 303  | 0.5019          | 0.7594   |
| 0.535         | 28.0   | 315  | 0.5112          | 0.7792   |
| 0.5319        | 28.98  | 326  | 0.4729          | 0.7856   |
| 0.4953        | 29.96  | 337  | 0.6292          | 0.7318   |
| 0.5408        | 30.93  | 348  | 0.5083          | 0.7877   |
| 0.5215        | 32.0   | 360  | 0.5131          | 0.7799   |
| 0.5291        | 32.98  | 371  | 0.4867          | 0.7983   |
| 0.4971        | 33.96  | 382  | 0.4742          | 0.7962   |
| 0.5004        | 34.93  | 393  | 0.4930          | 0.7806   |
| 0.4868        | 36.0   | 405  | 0.4550          | 0.8061   |
| 0.4784        | 36.98  | 416  | 0.4667          | 0.7912   |
| 0.469         | 37.96  | 427  | 0.4915          | 0.7856   |
| 0.455         | 38.93  | 438  | 0.5016          | 0.7537   |
| 0.4903        | 40.0   | 450  | 0.4874          | 0.7877   |
| 0.4904        | 40.98  | 461  | 0.5222          | 0.7629   |
| 0.513         | 41.96  | 472  | 0.4772          | 0.7877   |
| 0.4913        | 42.93  | 483  | 0.5386          | 0.7629   |
| 0.5216        | 44.0   | 495  | 0.4830          | 0.7827   |
| 0.4931        | 44.98  | 506  | 0.4692          | 0.7948   |
| 0.4835        | 45.96  | 517  | 0.4941          | 0.7757   |
| 0.5035        | 46.93  | 528  | 0.4716          | 0.7884   |
| 0.5068        | 48.0   | 540  | 0.5210          | 0.7841   |
| 0.5207        | 48.98  | 551  | 0.4656          | 0.8132   |
| 0.4753        | 49.96  | 562  | 0.4529          | 0.8025   |
| 0.4718        | 50.93  | 573  | 0.4403          | 0.8075   |
| 0.4757        | 52.0   | 585  | 0.4305          | 0.8132   |
| 0.4352        | 52.98  | 596  | 0.4104          | 0.8245   |
| 0.4349        | 53.96  | 607  | 0.4390          | 0.8125   |
| 0.4508        | 54.93  | 618  | 0.4409          | 0.8011   |
| 0.4596        | 56.0   | 630  | 0.4131          | 0.8323   |
| 0.4321        | 56.98  | 641  | 0.4257          | 0.8188   |
| 0.4433        | 57.96  | 652  | 0.4421          | 0.7969   |
| 0.4423        | 58.93  | 663  | 0.4430          | 0.7990   |
| 0.446         | 60.0   | 675  | 0.4328          | 0.8181   |
| 0.425         | 60.98  | 686  | 0.4385          | 0.8011   |
| 0.4363        | 61.96  | 697  | 0.4225          | 0.8139   |
| 0.4358        | 62.93  | 708  | 0.4114          | 0.8224   |
| 0.415         | 64.0   | 720  | 0.4110          | 0.8174   |
| 0.423         | 64.98  | 731  | 0.4090          | 0.8238   |
| 0.4161        | 65.96  | 742  | 0.4011          | 0.8160   |
| 0.4103        | 66.93  | 753  | 0.4207          | 0.8188   |
| 0.4254        | 68.0   | 765  | 0.4503          | 0.8004   |
| 0.429         | 68.98  | 776  | 0.4392          | 0.8033   |
| 0.4341        | 69.96  | 787  | 0.4159          | 0.8209   |
| 0.4574        | 70.93  | 798  | 0.4165          | 0.8224   |
| 0.4136        | 72.0   | 810  | 0.3954          | 0.8337   |
| 0.4226        | 72.98  | 821  | 0.3996          | 0.8301   |
| 0.4124        | 73.96  | 832  | 0.4205          | 0.8089   |
| 0.4209        | 74.93  | 843  | 0.4288          | 0.8146   |
| 0.4493        | 76.0   | 855  | 0.4193          | 0.8167   |
| 0.4302        | 76.98  | 866  | 0.4239          | 0.8132   |
| 0.4385        | 77.96  | 877  | 0.4187          | 0.8160   |
| 0.4388        | 78.93  | 888  | 0.4379          | 0.8047   |
| 0.4294        | 80.0   | 900  | 0.4048          | 0.8309   |
| 0.4207        | 80.98  | 911  | 0.4287          | 0.8139   |
| 0.4316        | 81.96  | 922  | 0.4183          | 0.8202   |
| 0.4283        | 82.93  | 933  | 0.4091          | 0.8224   |
| 0.4227        | 84.0   | 945  | 0.4070          | 0.8231   |
| 0.4335        | 84.98  | 956  | 0.4184          | 0.8224   |
| 0.4433        | 85.96  | 967  | 0.4148          | 0.8132   |
| 0.4287        | 86.93  | 978  | 0.4188          | 0.8167   |
| 0.4327        | 88.0   | 990  | 0.4091          | 0.8224   |
| 0.427         | 88.98  | 1001 | 0.4118          | 0.8202   |
| 0.4194        | 89.96  | 1012 | 0.4220          | 0.8153   |
| 0.4213        | 90.93  | 1023 | 0.4195          | 0.8096   |
| 0.4288        | 92.0   | 1035 | 0.4023          | 0.8188   |
| 0.4123        | 92.98  | 1046 | 0.4005          | 0.8393   |
| 0.4172        | 93.96  | 1057 | 0.3812          | 0.8309   |
| 0.4109        | 94.93  | 1068 | 0.3838          | 0.8294   |
| 0.4128        | 96.0   | 1080 | 0.3878          | 0.8294   |
| 0.3976        | 96.98  | 1091 | 0.4023          | 0.8259   |
| 0.4097        | 97.96  | 1102 | 0.3979          | 0.8153   |
| 0.4059        | 98.93  | 1113 | 0.3953          | 0.8294   |
| 0.4011        | 100.0  | 1125 | 0.3804          | 0.8344   |
| 0.4126        | 100.98 | 1136 | 0.3915          | 0.8259   |
| 0.425         | 101.96 | 1147 | 0.4140          | 0.8160   |
| 0.4066        | 102.93 | 1158 | 0.4207          | 0.8238   |
| 0.4265        | 104.0  | 1170 | 0.4016          | 0.8259   |
| 0.4225        | 104.98 | 1181 | 0.4059          | 0.8252   |
| 0.4201        | 105.96 | 1192 | 0.3980          | 0.8309   |
| 0.408         | 106.93 | 1203 | 0.4171          | 0.8202   |
| 0.422         | 108.0  | 1215 | 0.4475          | 0.8096   |
| 0.4251        | 108.98 | 1226 | 0.4139          | 0.8224   |
| 0.4261        | 109.96 | 1237 | 0.4113          | 0.8167   |
| 0.4147        | 110.93 | 1248 | 0.4355          | 0.8089   |
| 0.4407        | 112.0  | 1260 | 0.4453          | 0.8146   |
| 0.4167        | 112.98 | 1271 | 0.3987          | 0.8372   |
| 0.4152        | 113.96 | 1282 | 0.4008          | 0.8273   |
| 0.3952        | 114.93 | 1293 | 0.3843          | 0.8351   |
| 0.4159        | 116.0  | 1305 | 0.3949          | 0.8330   |
| 0.4014        | 116.98 | 1316 | 0.4113          | 0.8040   |
| 0.4203        | 117.96 | 1327 | 0.3988          | 0.8309   |
| 0.4159        | 118.93 | 1338 | 0.4037          | 0.8351   |
| 0.4065        | 120.0  | 1350 | 0.3847          | 0.8393   |
| 0.3938        | 120.98 | 1361 | 0.4023          | 0.8280   |
| 0.4202        | 121.96 | 1372 | 0.4015          | 0.8301   |
| 0.4316        | 122.93 | 1383 | 0.4156          | 0.8174   |
| 0.416         | 124.0  | 1395 | 0.3924          | 0.8344   |
| 0.4141        | 124.98 | 1406 | 0.3839          | 0.8358   |
| 0.4157        | 125.96 | 1417 | 0.3940          | 0.8224   |
| 0.3906        | 126.93 | 1428 | 0.3826          | 0.8287   |
| 0.4051        | 128.0  | 1440 | 0.3807          | 0.8316   |
| 0.3835        | 128.98 | 1451 | 0.3866          | 0.8386   |
| 0.3976        | 129.96 | 1462 | 0.3832          | 0.8457   |
| 0.3939        | 130.93 | 1473 | 0.3745          | 0.8351   |
| 0.3862        | 132.0  | 1485 | 0.3897          | 0.8408   |
| 0.3919        | 132.98 | 1496 | 0.3841          | 0.8429   |
| 0.3928        | 133.96 | 1507 | 0.3744          | 0.8507   |
| 0.3976        | 134.93 | 1518 | 0.3610          | 0.8535   |
| 0.3834        | 136.0  | 1530 | 0.3711          | 0.8422   |
| 0.3827        | 136.98 | 1541 | 0.3860          | 0.8422   |
| 0.4036        | 137.96 | 1552 | 0.3973          | 0.8301   |
| 0.3862        | 138.93 | 1563 | 0.3720          | 0.8429   |
| 0.3876        | 140.0  | 1575 | 0.3701          | 0.8478   |
| 0.3941        | 140.98 | 1586 | 0.3579          | 0.8500   |
| 0.3692        | 141.96 | 1597 | 0.3609          | 0.8521   |
| 0.3791        | 142.93 | 1608 | 0.3666          | 0.8493   |
| 0.3774        | 144.0  | 1620 | 0.3601          | 0.8521   |
| 0.3708        | 144.98 | 1631 | 0.3592          | 0.8549   |
| 0.3943        | 145.96 | 1642 | 0.3593          | 0.8493   |
| 0.3856        | 146.93 | 1653 | 0.3686          | 0.8429   |
| 0.381         | 148.0  | 1665 | 0.3755          | 0.8429   |
| 0.3965        | 148.98 | 1676 | 0.3698          | 0.8471   |
| 0.3862        | 149.96 | 1687 | 0.3641          | 0.8485   |
| 0.3825        | 150.93 | 1698 | 0.3652          | 0.8528   |
| 0.3751        | 152.0  | 1710 | 0.3672          | 0.8422   |
| 0.3812        | 152.98 | 1721 | 0.3626          | 0.8507   |
| 0.3805        | 153.96 | 1732 | 0.3615          | 0.8493   |
| 0.3755        | 154.93 | 1743 | 0.3678          | 0.8500   |
| 0.3802        | 156.0  | 1755 | 0.3682          | 0.8478   |
| 0.3781        | 156.98 | 1766 | 0.3802          | 0.8485   |
| 0.3845        | 157.96 | 1777 | 0.3753          | 0.8507   |
| 0.3893        | 158.93 | 1788 | 0.3694          | 0.8485   |
| 0.3676        | 160.0  | 1800 | 0.3652          | 0.8493   |
| 0.4114        | 160.98 | 1811 | 0.4020          | 0.8309   |
| 0.39          | 161.96 | 1822 | 0.3615          | 0.8528   |
| 0.3831        | 162.93 | 1833 | 0.3570          | 0.8535   |
| 0.3651        | 164.0  | 1845 | 0.3642          | 0.8401   |
| 0.3662        | 164.98 | 1856 | 0.3557          | 0.8577   |
| 0.3878        | 165.96 | 1867 | 0.3650          | 0.8457   |
| 0.376         | 166.93 | 1878 | 0.3601          | 0.8500   |
| 0.3724        | 168.0  | 1890 | 0.3617          | 0.8570   |
| 0.3661        | 168.98 | 1901 | 0.3677          | 0.8535   |
| 0.3869        | 169.96 | 1912 | 0.3617          | 0.8500   |
| 0.3717        | 170.93 | 1923 | 0.3594          | 0.8436   |
| 0.3698        | 172.0  | 1935 | 0.3632          | 0.8514   |
| 0.3761        | 172.98 | 1946 | 0.3614          | 0.8471   |
| 0.3847        | 173.96 | 1957 | 0.3566          | 0.8535   |
| 0.3716        | 174.93 | 1968 | 0.3570          | 0.8528   |
| 0.3695        | 176.0  | 1980 | 0.3557          | 0.8556   |
| 0.3702        | 176.98 | 1991 | 0.3544          | 0.8556   |
| 0.372         | 177.96 | 2002 | 0.3522          | 0.8542   |
| 0.3648        | 178.93 | 2013 | 0.3562          | 0.8493   |
| 0.3744        | 180.0  | 2025 | 0.3577          | 0.8507   |
| 0.3546        | 180.98 | 2036 | 0.3524          | 0.8535   |
| 0.3613        | 181.96 | 2047 | 0.3478          | 0.8528   |
| 0.3581        | 182.93 | 2058 | 0.3534          | 0.8549   |
| 0.3709        | 184.0  | 2070 | 0.3637          | 0.8521   |
| 0.3699        | 184.98 | 2081 | 0.3544          | 0.8549   |
| 0.3701        | 185.96 | 2092 | 0.3506          | 0.8613   |
| 0.3634        | 186.93 | 2103 | 0.3559          | 0.8592   |
| 0.3668        | 188.0  | 2115 | 0.3510          | 0.8585   |
| 0.3629        | 188.98 | 2126 | 0.3485          | 0.8592   |
| 0.3544        | 189.96 | 2137 | 0.3478          | 0.8627   |
| 0.3714        | 190.93 | 2148 | 0.3512          | 0.8592   |
| 0.3681        | 192.0  | 2160 | 0.3522          | 0.8592   |
| 0.3466        | 192.98 | 2171 | 0.3523          | 0.8570   |
| 0.3727        | 193.96 | 2182 | 0.3504          | 0.8606   |
| 0.3564        | 194.93 | 2193 | 0.3501          | 0.8577   |
| 0.3616        | 195.56 | 2200 | 0.3499          | 0.8577   |


### Framework versions

- Transformers 4.39.0
- Pytorch 2.2.1+cu121
- Datasets 2.18.0
- Tokenizers 0.15.2