whisper-large-finnish-v3
/
community-events
/computer-vision-study-group
/Sessions
/MaskedAutoEncoders.md
# Masked Autoencoders are Scalable Vision Learners | |
Session by [johko](https://github.com/johko) | |
## Recording πΊ | |
[YouTube](https://www.youtube.com/watch?v=AC6flxUFLrg&pp=ygUdaHVnZ2luZyBmYWNlIHN0dWR5IGdyb3VwIHN3aW4%3D) | |
## Session Slides π₯οΈ | |
[Google Drive](https://docs.google.com/presentation/d/10ZZ-Rl1D57VX005a58OmqNeOB6gPnE54/edit?usp=sharing&ouid=107717747412022342990&rtpof=true&sd=true) | |
## Original Paper π | |
[Hugging Face](https://huggingface.co/papers/2111.06377) / | |
[arxiv](https://arxiv.org/abs/2111.06377) | |
## GitHub Repo π§π½βπ» | |
https://github.com/facebookresearch/mae | |
## Additional Resources π | |
- [Transformers Docs ViTMAE](https://huggingface.co/docs/transformers/model_doc/vit_mae) | |
- [Transformers ViTMAE Demo Notebook](https://github.com/NielsRogge/Transformers-Tutorials/tree/master/ViTMAE) by Niels Rogge |