Graphcore
/

vit-base-ipu

Dongsung commited on May 19, 2022

Commit

6468a89

•

1 Parent(s): dc139a4

Remove unncessary " ' " in paper link

Files changed (1) hide show

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ The Vision Transformer (ViT) is a model for image recognition that employs a Tra
 It uses a standard Transformer encoder as used in NLP and simple, yet scalable, strategy works surprisingly well when coupled with pre-training on large amounts of dataset and tranferred to multiple size image recognition benchmarks while requiring substantially fewer computational resources to train.
-Paper link : [AN IMAGE IS WORTH 16X16 WORDS:TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE'](https://arxiv.org/pdf/2010.11929.pdf)
 ## Usage

 It uses a standard Transformer encoder as used in NLP and simple, yet scalable, strategy works surprisingly well when coupled with pre-training on large amounts of dataset and tranferred to multiple size image recognition benchmarks while requiring substantially fewer computational resources to train.
+Paper link : [AN IMAGE IS WORTH 16X16 WORDS:TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE](https://arxiv.org/pdf/2010.11929.pdf)
 ## Usage