Spaces:

all-things-vits
/

README

Running

sayakpaul HF staff commited on Mar 23, 2023

Commit

bc158f1

1 Parent(s): a145ae6

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -7,4 +7,14 @@ sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card 🔥

 pinned: false
 ---
+# All Things ViTs: Understanding and Interpreting Attention in Vision (CVPR'23 tutorial)
+*By: [Hila Chefer](https://hila-chefer.github.io) and [Sayak Paul](https://sayak.dev)*
+*Website: [atv.github.io])https://atv.github.io)*
+*Abstract: In this tutorial, we explore different ways to leverage attention in vision. From left to right: (i) attention can be used to explain the predictions by the model (e.g., CLIP for an image-text pair) (ii) By manipulating the attention-based explainability maps, one can enforce that the prediction is made based on the right reasons (e.g., foreground vs. background) (iii) The cross-attention maps of multi-modal models can be used to guide generative models (e.g., mitigating neglect in Stable Diffusion).*
+This organization hosts all the interactive demos to be presented at the tutorial. Below, you can find some of them.