OmAlve's picture
linked base model
cfffac3 verified
metadata
base_model:
  - timm/swin_s3_base_224.ms_in1k
tags:
  - image-classification
  - timm
library_name: timm
license: apache-2.0
datasets:
  - food101
metrics:
  - accuracy

Model card for swin_s3_base_224-Foods-101

Model Details

Model Name: Swin Transformer (swin_s3_base_224)

Architecture: Swin Transformer

Pre-trained Model: Swin Transformer Base (swin_base_patch4_window7_224)

Fine-tuning Dataset: Food-101

Model Description

This model is a fine-tuned version of the Swin Transformer Base model (swin_base_patch4_window7_224) on the Foods-101 dataset. The Swin Transformer is a powerful vision transformer architecture that introduces a hierarchical Swin Transformer block to efficiently model long-range dependencies in images.

The pre-trained Swin Transformer Base model was fine-tuned on the Foods-101 dataset, which consists of 101 food categories.

Intended Use

This fine-tuned model can be used for classifying food images into one of the 101 categories present in the Foods-101 dataset. It can be employed in various applications related to food recognition, dietary analysis, recipe recommendation systems, and more