--- license: apache-2.0 base_model: google/vit-base-patch16-224-in21k tags: - generated_from_trainer datasets: - imagefolder metrics: - accuracy model-index: - name: vit-dunham-carbonate-classifier results: - task: name: Image Classification type: image-classification dataset: name: imagefolder type: imagefolder config: default split: train args: default metrics: - name: Accuracy type: accuracy value: 0.8888888888888888 --- # vit-dunham-carbonate-classifier ## Model description This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the [Lokier & Al Junaibi (2016)](https://onlinelibrary.wiley.com/doi/10.1111/sed.12293) data S1. The model captures the expertise of 177 volunteers from 33 countries with 3,270 years of academic & industry experience in classifying 14 carbonate thin section samples by using the classical [Dunham (1962)](https://en.wikipedia.org/wiki/Dunham_classification) carbonate classification. ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64ff0bce56243ce8cb6df456/IXs0cK2sflvbCg5EJAiMo.png) ([Source](https://commons.wikimedia.org/wiki/File:Dunham_classification_EN.svg)) In the original paper, the authors intended to objectively analyze whether these volunteers have the same standards in applying Dunham classification. ## Intended uses & limitations - Input: Carbonate thin section image, can be either parallel-polarized (PPL) or cross-polarized (XPL) - Output: Dunham classification (Mudstone/Wackestone/Packstone/Grainstone/Boundstone/Crystalline) and the probability value - Limitation: The original dataset is missing Boundstone sample, hence it cannot classify a Boundstone. Sample image source: [Grainstone - Wikipedia](https://en.wikipedia.org/wiki/Grainstone) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64ff0bce56243ce8cb6df456/r4aBwewYuL-WLfTdqqFL-.png) ## Training and evaluation data Source: [Lokier & Al Junaibi (2016), Data S1](https://onlinelibrary.wiley.com/action/downloadSupplement?doi=10.1111%2Fsed.12293&file=sed12293-sup-0001-SupInfo.zip) The data consists of 14 samples. Each samples has 3 magnifications (x2, x4, and x10) and taken in PPL and XPL. Hence, there are 14 samples * 3 magnifications * 2 polarizations = 84 images in the training dataset. Classification for each sample is taken from the most popular respondent's response in Table 7. - Sample 1: Packstone - Sample 2: Grainstone - Sample 3: Wackestone - Sample 4: Packstone - Sample 5: Wackestone - Sample 6: Packstone - Sample 7: Packstone - Sample 8: Mudstone - Sample 9: Crystalline - Sample 10: Grainstone - Sample 11: Wackestone - Sample 12: Grainstone - Sample 13: Grainstone - Sample 14: Mudstone ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 5e-05 - train_batch_size: 8 - eval_batch_size: 8 - seed: 42 - gradient_accumulation_steps: 2 - total_train_batch_size: 16 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - lr_scheduler_warmup_ratio: 0.1 - num_epochs: 20 ### Training results | Training Loss | Epoch | Step | Validation Loss | Accuracy | |:-------------:|:-----:|:----:|:---------------:|:--------:| | 1.5764 | 1.0 | 5 | 1.5329 | 0.4444 | | 1.3991 | 2.0 | 10 | 1.4253 | 0.5556 | | 1.2792 | 3.0 | 15 | 1.2851 | 0.7778 | | 1.0119 | 4.0 | 20 | 1.1625 | 0.8889 | | 0.9916 | 5.0 | 25 | 1.0471 | 0.8889 | | 0.9202 | 6.0 | 30 | 0.9836 | 0.7778 | | 0.6994 | 7.0 | 35 | 0.8649 | 0.8889 | | 0.526 | 8.0 | 40 | 0.7110 | 1.0 | | 0.5383 | 9.0 | 45 | 0.6127 | 1.0 | | 0.5128 | 10.0 | 50 | 0.5337 | 1.0 | | 0.4312 | 11.0 | 55 | 0.4887 | 1.0 | | 0.3827 | 12.0 | 60 | 0.4365 | 1.0 | | 0.3452 | 13.0 | 65 | 0.3891 | 1.0 | | 0.3164 | 14.0 | 70 | 0.3677 | 1.0 | | 0.2899 | 15.0 | 75 | 0.3555 | 1.0 | | 0.2878 | 16.0 | 80 | 0.3197 | 1.0 | | 0.2884 | 17.0 | 85 | 0.3056 | 1.0 | | 0.2633 | 18.0 | 90 | 0.3107 | 1.0 | | 0.2669 | 19.0 | 95 | 0.3164 | 1.0 | | 0.2465 | 20.0 | 100 | 0.2949 | 1.0 | ### Framework versions - Transformers 4.33.2 - Pytorch 2.0.1+cu118 - Datasets 2.14.5 - Tokenizers 0.13.3