longlian/llm-grounded-diffusion · Apply for community grant: Personal project

Owner Jun 12, 2023

•

edited Jun 12, 2023

LLM-grounded Diffusion is a personal project that enables LLM grounding to diffusion models (https://llm-grounded-diffusion.github.io/).

LLM-grounded Diffusion allows enhanced prompt understanding for text-to-image generation models such as stable diffusion. Our method significantly improve the prompt understanding compared to baseline SD and allow multi-round dialog-based text-to-image generation and generation with prompts in languages that SD doesn't support.

We already have a working local version of the space that includes the full pipeline (both text-to-layout and layout-to-image generation). However, due to a limited personal budget (as I'm a PhD student) and the fact that image generation is computationally intensive, I'm not able to make the layout-to-image part work without a GPU accelerator.

In order to allow people in the community to try our model and compare with baseline image generation methods, I'm requesting an Ampere-series GPU accelerator to this space (for its support of FlashAttention). I hope the hf team could kindly understand the request.

Thanks for the support to this project and the hf space!

longlian

Owner Jun 13, 2023

@akhaliq Could you please take a look? Thanks!

akhaliq

Jun 15, 2023

Hi @longlian , we have assigned a gpu to this space. Note that GPU Grants are provided temporarily and might be removed after some time if the usage is very low.

To learn more about GPUs in Spaces, please check out https://huggingface.co/docs/hub/spaces-gpus

longlian

Owner Jun 15, 2023

Thank you AK!

longlian changed discussion status to closed Jun 15, 2023