Apply for community grant: Academic project

#1
by GMFTBY - opened

Project Description

PandaGPT is the first foundation model capable of instruction-following data across six modalities, without the need of explicit supervision. It demonstrates a diverse set of multimodal capabilities such as complex understanding/reasoning, knowledge-grounded description, and multi-turn conversation.

PandaGPT is a general-purpose instruction-following model that can both see ๐Ÿ‘€ and hear๐Ÿ‘‚. Our pilot experiments show that PandaGPT can perform complex tasks such as detailed image description generation, writing stories inspired by videos, and answering questions about audios. More Interestingly, PandaGPT can take multimodal inputs simultaneously and compose their semantics naturally. For example, PandaGPT can connect how objects look in a photo and how they sound in an audio.

Financial Situation

The major contributors of the project are mostly Ph.D. students. Unfortunately, due to financial constraints inherent to our status as students, self-funding such a critical resource is untenable. Therefore, we humbly request your consideration for the grant in question. With your support, we believe we can further contribute to the body of knowledge in our area of research, enhancing both our academic growth and potential societal impact.

For more details and use cases, please refer to our blog and github.

Hi @GMFTBY , we have assigned a gpu to this space. Note that GPU Grants are provided temporarily and might be removed after some time if the usage is very low.

To learn more about GPUs in Spaces, please check out https://huggingface.co/docs/hub/spaces-gpus

osanseviero changed discussion status to closed

Sign up or log in to comment