File size: 1,049 Bytes
ae9e694
 
 
 
 
 
 
a13a284
6eb3f84
b142810
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
license: openrail
---
Trained From Scratch, MNIST https://huggingface.co/datasets/mnist

Guide: https://medium.com/mlearning-ai/vision-transformers-from-scratch-pytorch-a-step-by-step-guide-96c3313c2e0c

ViT_Small: {"chw": (1, 28, 28), "n_patches": 7, "n_blocks": 4, "hidden_d": 8, "n_heads": 4, "out_d": 10} 23 kB 2K+ Steps

ViT_Large: {"chw": (1, 28, 28), "n_patches": 7, "n_blocks": 6, "hidden_d": 64, "n_heads": 8, "out_d": 10} 881 kB 20K+ Steps

![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a8ef31556443f24b803be/Po6WNrl75QNXrLKBCQGng.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a8ef31556443f24b803be/pvJdoLQtxdaiRx9wetwsR.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a8ef31556443f24b803be/YXnktdx1JZwRx6q_ihp_M.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a8ef31556443f24b803be/HmpyPsUY0fWzlsjcfjCyd.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a8ef31556443f24b803be/DHGwqT-b8zi4mqhYIIUmk.png)