Generates images from input text. These models can be used to generate and modify images based on text prompts.


About Text-to-Image

Use Cases

Data Generation

Businesses can generate data for their their use cases by inputting text and getting image outputs.

Immersive Conversational Chatbots

Chatbots can be made more immersive if they provide contextual images based on the input provided by the user.

Creative Ideas for Fashion Industry

Different patterns can be generated to obtain unique pieces of fashion. Text-to-image models make creations easier for designers to conceptualize their design before actually implementing it.

Architecture Industry

Architects can utilise the models to construct an environment based out on the requirements of the floor plan. This can also include the furniture that has to be placed in that environment.

Task Variants

Useful Resources

This page was made possible thanks to efforts of Ishan Dutta and Oğuz Akif.

Note A model that can be used to generate images based on text prompts. The DALL·E Mega model is the largest version of DALLE Mini.

Note A latent text-to-image diffusion model capable of generating photo-realistic images given any text input.

Note RedCaps is a large-scale dataset of 12M image-text pairs collected from Reddit.

Note Conceptual Captions is a dataset consisting of ~3.3M images annotated with captions.

