File size: 818 Bytes
b0ae254 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
# How Do Vision Transformers Work
Session by [johko](https://github.com/johko)
## Session Slides π₯οΈ
[Google Drive](https://docs.google.com/presentation/d/1PewOHVABkxx0jO9PoJSQi8to_WNlL4HdDp4M9e4L8hs/edit?usp=drivesdks)
## Original Paper π
[Hugging Face](https://huggingface.co/papers/2202.06709) /
[arxiv](https://arxiv.org/pdf/2202.06709.pdf)
## GitHub Repo π§π½βπ»
https://github.com/microsoft/Swin-Transformer
## Additional Resources π
Hessian Matrices:
- https://stackoverflow.com/questions/23297090/how-calculating-hessian-works-for-neural-network-learning
- https://machinelearningmastery.com/a-gentle-introduction-to-hessian-matrices/
Loss Landscape Visualization:
- https://mathformachines.com/posts/visualizing-the-loss-landscape/
- https://github.com/tomgoldstein/loss-landscape |