ViTSTR small v1.0

ViTSTR model pre-trained on various real STR datasets at image size 224x224 with a patch size of 16x16.

Disclaimer: this model card was not written by the original author.

Model description

TODO

Intended uses & limitations

You can use the model for STR on images containing Latin characters (62 case-sensitive alphanumeric + 32 punctuation marks).

How to use

TODO

BibTeX entry and citation info

@InProceedings{atienza2021vision,
  title={Vision transformer for fast and efficient scene text recognition},
  author={Atienza, Rowel},
  booktitle={International Conference on Document Analysis and Recognition},
  pages={319--334},
  year={2021},
  organization={Springer}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.