dalle-mini / README.md
boris's picture
docs: add link to requirements
b7b2e31 unverified
metadata
title: DALL·E mini
emoji: 🥑
colorFrom: red
colorTo: purple
sdk: streamlit
app_file: app/app.py
pinned: false

DALL·E Mini

Generate images from a text prompt

Our logo was generated with DALL·E mini using the prompt "logo of an armchair in the shape of an avocado".

You can create your own pictures with the demo (temporarily in beta on Huging Face Spaces but soon to be open to all).

How does it work?

Refer to our report.

Development

Dependencies Installation

The root folder and associated requirements.txt is only for the app.

For development, use dev/requirements.txt or dev/environment.yaml.

Training of VQGAN

The VQGAN was trained using taming-transformers.

We recommend using the latest version available.

Conversion of VQGAN to JAX

Use patil-suraj/vqgan-jax.

Training of Seq2Seq

Refer to dev/seq2seq folder.

You can also adjust the sweep configuration file if you need to perform a hyperparameter search.

Inference Pipeline

To generate sample predictions and understand the inference pipeline step by step, refer to dev/inference/inference_pipeline.ipynb.

Open In Colab

Where does the logo come from?

The "armchair in the shape of an avocado" was used by OpenAI when releasing DALL·E to illustrate the model's capabilities. Having successful predictions on this prompt represents a big milestone to us.

Authors

Acknowledgements