cvetanovskaa
commited on
Commit
•
b82f455
1
Parent(s):
b61042f
Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ pipeline_tag: image-classification
|
|
12 |
Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224. It was introduced in the paper [An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale](https://arxiv.org/abs/2010.11929) by Dosovitskiy et al. and first released in [this repository](https://github.com/google-research/vision_transformer). Fine-tuned on the German Traffic Sign Recognition Benchmark Dataset.
|
13 |
|
14 |
## Model description
|
15 |
-
- Model Architecture: Vision Transformer (ViT) - google/vit-base-patch16-224.
|
16 |
- Fine-tuning Objective: Classify traffic signs into 43 different categories, including various speed limits, warning signs, and prohibitory or regulatory signs.
|
17 |
- Developer: Aleksandra Cvetanovska
|
18 |
|
|
|
12 |
Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224. It was introduced in the paper [An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale](https://arxiv.org/abs/2010.11929) by Dosovitskiy et al. and first released in [this repository](https://github.com/google-research/vision_transformer). Fine-tuned on the German Traffic Sign Recognition Benchmark Dataset.
|
13 |
|
14 |
## Model description
|
15 |
+
- Model Architecture: Vision Transformer (ViT) - google/vit-base-patch16-224-21k.
|
16 |
- Fine-tuning Objective: Classify traffic signs into 43 different categories, including various speed limits, warning signs, and prohibitory or regulatory signs.
|
17 |
- Developer: Aleksandra Cvetanovska
|
18 |
|