dennisjooo
/

emotion_classification

Image Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

dennisjooo commited on Sep 13, 2023

Commit

170a155

·

1 Parent(s): 2409a1c

Update README.md

Files changed (1) hide show

README.md +20 -9

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ base_model: google/vit-base-patch16-224-in21k
 tags:
 - generated_from_trainer
 datasets:
-- imagefolder
 metrics:
 - accuracy
 - precision
@@ -16,8 +16,8 @@ model-index:
       name: Image Classification
       type: image-classification
     dataset:
-      name: imagefolder
-      type: imagefolder
       config: default
       split: train
       args: default
@@ -38,7 +38,8 @@ should probably proofread and complete it, then remove this comment. -->
 # emotion_classification
-This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the imagefolder dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.1031
 - Accuracy: 0.6312
@@ -47,15 +48,25 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 tags:
 - generated_from_trainer
 datasets:
+- FastJobs/Visual_Emotional_Analysis
 metrics:
 - accuracy
 - precision
       name: Image Classification
       type: image-classification
     dataset:
+      name: FastJobs/Visual_Emotional_Analysis
+      type: FastJobs/Visual_Emotional_Analysis
       config: default
       split: train
       args: default
 # emotion_classification
+This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k)
+on the [FastJobs/Visual_Emotional_Analysis](https://huggingface.co/datasets/FastJobs/Visual_Emotional_Analysis) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.1031
 - Accuracy: 0.6312
 ## Model description
+The Vision Transformer base version trained on ImageNet-21K released by Google.
+Further details can be found on their [repo]((https://huggingface.co/google/vit-base-patch16-224-in21k))
+## Training and evaluation data
+### Data Split
+Used a 4:1 ratio for training and development sets and a seed of 42.
+### Pre-processing Augmentation
+The main pre-processing phase for both training and evaluation includes:
+- Resizing to (224, 224, 3) because it uses ImageNet images to train the original model
+- Normalizing images using a mean and standard deviation of [0.5, 0.5, 0.5]
+Other than the aforementioned pre-processing, the training set was augmented using:
+- Random horizontal & vertical flip
+- Color jitter
+- Random resized crop
 ## Training procedure