Submitted by akhaliq 13 Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts · 10 authors 1