Apply for community grant: Academic project (gpu and storage)

#1
by mucai - opened
Owner

We present Matryoshka Multimodal Models (M3), which represents visual tokens in a nested manner following the coarse-to-fine order. Now users can explicitly control the visual granularity per test instance during inference! It will be great to host this model in huggingface!
@akhaliq

teaser.png

Hi @mucai , we've assigned ZeroGPU to this Space. Please check the compatibility and usage sections of this page so your Space can run on ZeroGPU.

Owner

Huge thanks!

Sign up or log in to comment