Preprocessing Guide
Dataset download
We follow the preprocessing code used in edm2. In this code we made a several edits: (1) we removed unncessary parts except preprocessing because this code is only used for preprocessing, (2) we use [-1, 1] range for an input to the stable diffusion VAE (similar to DiT or SiT) unlike edm2 that uses [0, 1] range, and (3) we consider preprocessing to 256x256 resolution (or 512x512 resolution).
After downloading ImageNet, please run the following scripts (please update 256x256 to 512x512 if you want to do experiments on 512x512 resolution);
Convert raw ImageNet data to a ZIP archive at 256x256 resolution
bash dataset_prepare_encode.sh
Convert the pixel data to VAE latents
bash dataset_prepare_convert.sh
Here,YOUR_DOWNLOAD_PATH is the directory that you downloaded the dataset, and TARGET_PATH is the directory that you will save the preprocessed images and corresponding compressed latent vectors. This directory will be used for your experiment scripts.
Acknowledgement
This code is mainly built upon edm2 repository.