Spaces:
Running
title: Dalle Mini
emoji: 🎨
colorFrom: red
colorTo: blue
sdk: gradio
app_file: app/app_gradio_ngrok.py
pinned: false
DALL-E Mini
Generate images from a text prompt
TODO: add some cool example
Create my own images with the demo → Coming soon
How does it work?
Refer to our report.
Development
This section is for the adventurous people wanting to look into the code.
Dependencies Installation
The root folder and associated requirements.txt
is only for the app.
You will find necessary requirements in each sub-section.
You should create a new python virtual environment and install the project dependencies inside the virtual env. You need to use the -f
(--find-links
) option for pip
to be able to find the appropriate libtpu
required for the TPU hardware.
Adapt the installation to your own hardware and follow library installation instructions.
$ pip install -r requirements.txt -f https://storage.googleapis.com/jax-releases/libtpu_releases.html
If you use conda
, you can create the virtual env and install everything using: conda env update -f environments.yaml
Training of VQGAN
The VQGAN was trained using taming-transformers.
We recommend using the latest version available.
Conversion of VQGAN to JAX
Training of Seq2Seq
Refer to seq2seq
folder (some parameters may have been hardcoded for convenience when training on our TPU VM).
You can also adjust the sweep configuration file if you need to perform a hyperparameter search.
Inference
Refer to the demo notebooks. TODO: add links
Authors
- Boris Dayma
- Suraj Patil
- Pedro Cuenca
- Khalid Saifullah
- Tanishq Abraham
- Phúc Lê Khắc
- Luke Melas
- Ritobrata Ghosh
Acknowledgements
- 🤗 Hugging Face for organizing the FLAX/JAX community week
- Google Cloud team for providing access to TPU's