Apply for community grant: Personal project (gpu)

#1
by shenyunhang - opened

We present APE, a universal visual perception model for aligning and prompting everything all at once in an image to perform diverse tasks, i.e., detection, segmentation, and grounding, as an instance-level sentence-object matching paradigm. APE aligns vision and language representation on broad data with natural and challenging characteristics all at once without task-specific fine-tuning. The extensive experiments on over 160 datasets demonstrate that, with only one-suit of weights, APE outperforms (or is on par with) the state-of-the-art models, proving that an effective yet universal perception for anything aligning and prompting is indeed feasible.

Arxiv page: https://arxiv.org/abs/2312.02153
Project page: https://github.com/shenyunhang/APE

image.png

example_1.png

Hi @shenyunhang , we have assigned a gpu to this space. Note that GPU Grants are provided temporarily and might be removed after some time if the usage is very low.

To learn more about GPUs in Spaces, please check out https://huggingface.co/docs/hub/spaces-gpus

Hi @hysts , thank you very much, it helps a lot.

Sign up or log in to comment