why vivit in huggingface has no factorized encoders and so on?

by tsaganshosg - opened Jul 19, 2024

Jul 19, 2024

Dear friends,
i find that the torch implementation here has no "factorized encoder", "factorized self-attention", ...
the implementation of patchifying here is just a simple "Joint Space-Time"
thank you!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment