Spaces:
Sleeping
Apply for community grant: Personal project (gpu)
This demo runs a model MambaOut proposed by our interesting paper "MambaOut: Do We Really Need Mamba for Vision?" Based on our concept discussion, we hypothesize Mamba is unnecessary for ImageNet while exploring for detection and segmentation remains worthwhile. To verify these, we build MambaOut with Mamba blocks but remove their core token mixer, SSM. MambaOut outperforms all visual Mamba on ImageNet but cannot match SOTA Mamba for detection and segmentation.
Paper: https://arxiv.org/abs/2405.07992
Code: https://github.com/yuweihao/MambaOut
Demo: https://huggingface.co/spaces/whyu/MambaOut
Thank you for your great help. I am struggling to debug it.
As you can see in the org card of the ZeroGPU explorers org, multiple ZeroGPU Spaces share the hardwares, so it's expected. But the model is not offloaded to CPU and stays on GPU for a while after finishing inference so that the overhead can be minimized when many people are visiting the Space.