Model Card for vit-small-patch16-224-single-channel

A Vision Transformer (ViT) image model with a single channel.

Model Details

This model is a variant of the vit_small_patch16_224.augreg_in21k_ft_in1k model from the timm library that contains a single channel instead of three.

The timm model was converted to a transformers model using a modified version of the convert_vit_timm_to_pytorch.py script which is included in this repository as convert.py.

Downloads last month: 122

Safetensors

Model size

21.6M params

Tensor type

F32

Inference Providers NEW

Image Feature Extraction

This model is not currently available via any of the supported Inference Providers.