hula07
/

classify_images

Model card Files Files and versions Community

hula07 commited on May 18, 2023

Commit

00e44e8

•

1 Parent(s): f3ddde5

Update README.md

Files changed (1) hide show

README.md +18 -3

README.md CHANGED Viewed

@@ -5,8 +5,6 @@ language_creators:
 - found
 language:
 - en
-license:
-- mit
 multilinguality:
 - monolingual
 size_categories:
@@ -17,4 +15,21 @@ task_categories:
 - image-classification
 task_ids:
 - multi-class-image-classification
----

 - found
 language:
 - en
 multilinguality:
 - monolingual
 size_categories:
 - image-classification
 task_ids:
 - multi-class-image-classification
+datasets:
+- cifar10
+---
+## Image Classification with Vision Transformer (ViT)
+This repository contains a Python script for training an image classification model using the Vision Transformer (ViT) architecture. We use the transformers and datasets libraries from Hugging Face along with PyTorch and TensorFlow for the implementation.
+### Functions and Usage
+ * convert_to_tf_tensor(image: Image):
+ *  This function converts an image to a Tensorflow tensor with a size of 224x224 and three color channels.
+ * preprocess(batch):
+ * Preprocesses the images in a batch, using the feature extractor to convert them to pixel values. It also adds the labels to the batch.
+ * collate_fn(batch):
+ * This function prepares the batch for training or evaluation. It stacks the pixel values and labels.
+ * compute_metrics(p):
+ * Computes the metrics (accuracy) for the predictions.