Data efficient Image Transformers.
DeiT small model pretrained on imagenet with 300 epochs:
DeiT small model finetuned on imagenet with 15 epochs:
-