dalle-mini / README.md
khalidsaifullaah's picture
comments removed
2b5c84e
|
raw
history blame
2.6 kB
metadata
title: Dalle Mini
emoji: 🥑
colorFrom: red
colorTo: blue
sdk: gradio
app_file: app/app_gradio_ngrok.py
pinned: false

DALL-E Mini

Generate images from a text prompt

Our logo was generated with DALL-E mini using the prompt "logo of an armchair in the shape of an avocado".

You can also create your own pictures with the demo (TODO: add link).

How does it work?

Refer to our report.

Development

This section is for the adventurous people wanting to look into the code.

Dependencies Installation

The root folder and associated requirements.txt is only for the app.

You will find necessary requirements in each sub-section.

You should create a new python virtual environment and install the project dependencies inside the virtual env. You need to use the -f (--find-links) option for pip to be able to find the appropriate libtpu required for the TPU hardware.

Adapt the installation to your own hardware and follow library installation instructions.

$ pip install -r requirements.txt -f https://storage.googleapis.com/jax-releases/libtpu_releases.html

If you use conda, you can create the virtual env and install everything using: conda env update -f environments.yaml

Training of VQGAN

The VQGAN was trained using taming-transformers.

We recommend using the latest version available.

Conversion of VQGAN to JAX

Use patil-suraj/vqgan-jax.

Training of Seq2Seq

Refer to seq2seq folder (some parameters may have been hardcoded for convenience when training on our TPU VM).

You can also adjust the sweep configuration file if you need to perform a hyperparameter search.

Inference

Refer to the demo notebooks. TODO: add links

Authors

Acknowledgements