Model Card for vit-small-patch16-224-single-channel

A Vision Transformer (ViT) image model with a single channel.

Model Details

This model is a variant of the vit_small_patch16_224.augreg_in21k_ft_in1k model from the timm library that contains a single channel instead of three.

The timm model was converted to a transformers model using a modified version of the convert_vit_timm_to_pytorch.py script which is included in this repository as convert.py.

Downloads last month
122
Safetensors
Model size
21.6M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.