--- tags: - vision --- Note: this model can only be used once https://github.com/huggingface/transformers/pull/29012 is merged