File size: 818 Bytes
09b13b3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
# How Do Vision Transformers Work
Session by [johko](https://github.com/johko)


## Session Slides πŸ–₯️
[Google Drive](https://docs.google.com/presentation/d/1PewOHVABkxx0jO9PoJSQi8to_WNlL4HdDp4M9e4L8hs/edit?usp=drivesdks)


## Original Paper πŸ“„
[Hugging Face](https://huggingface.co/papers/2202.06709) /
[arxiv](https://arxiv.org/pdf/2202.06709.pdf)


## GitHub Repo πŸ§‘πŸ½β€πŸ’»
https://github.com/microsoft/Swin-Transformer


## Additional Resources πŸ“š
Hessian Matrices:

- https://stackoverflow.com/questions/23297090/how-calculating-hessian-works-for-neural-network-learning
- https://machinelearningmastery.com/a-gentle-introduction-to-hessian-matrices/

Loss Landscape Visualization:

- https://mathformachines.com/posts/visualizing-the-loss-landscape/
- https://github.com/tomgoldstein/loss-landscape