Fine-tuning a Vision Transformer on the Big Cats Dataset In this project, we fine-tuned a vision transformer on the Big Cats dataset to perform image classification. The Big Cats dataset consists of 2339 images of 10 different types of big cats, including lions, tigers, jaguars, and more.

Our goal was to train a model that could accurately classify these images with high accuracy. After fine-tuning a pre-trained Vision Transformer, we were able to achieve an accuracy of 98%.

Kaggle dataset: https://www.kaggle.com/datasets/gpiosenka/cats-in-the-wild-image-classification

Downloads last month
18
Safetensors
Model size
85.8M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.