How to obtain the sequence output of Visual Encoder?

by HeyQ8747 - opened

I wanted to get the sequence output from the Visual Encoder, but found that open_clip can only output cls token. I tried to rewrite the forward function myself but didn't get the same result.

Microsoft org

Hmm, it should be doable, but I'm not sure of the best way off the top of my head. Can you open an issue on the main open_clip repo and link here?

Sign up or log in to comment