bluestarburst's picture
Upload folder using huggingface_hub
ddb7519
|
raw
history blame
4.4 kB
# Mass compressing images in a folder
## How it Works
This script will sweep a folder and compress all the images to a given total number of megapixels. Aspect ratio is not changed and nothing is cropped.
*Images will only be resized if they exceed the specified megapixel limit.* Images within the limit will not be resized.
This script will also correct issues with images having EXIF directives that rotate the image. Or, in other words, it will make sure the proper orientation is saved native to the output image as trainers may not respect EXIF rotation directives.
EXIF rotation correction will take place regardless of whether images are resized.
Note this script will not attempt to copy or move any ICC color profiles at this time. Trainers likely do not respect this anyway...
Defaults are 1.5 megapixels, output is WEBPb at "quality 95" which affects the compression ratio. 90-99 are sane values for quality. For the purposes of training stable diffusion, 1.5 megapixels is a good balance between quality and file size, and quality of 90-99 is as well.
If you are hoping to use massive training files in the future as tech advances, you may wish to change the --max_mp setting to a higher value, but for now 1.5MP is more than enough to last for a few more advances in the technology. Ultimately this is your choice. EveryDream trainer is built to handle multiple aspects, but if you want to use images for another trainer and will crop square, you may wish to use a higher value to make sure the images remain large after croppy, or consider cropping carefully first *before* running this script.
## Usage
usage: compress_img.py [-h] [--img_dir IMG_DIR] [--out_dir OUT_DIR]
[--max_mp MAX_MP] [--quality QUALITY] [--overwrite]
[--noresize] [--delete]
Compress images in a directory.
options:
-h, --help show this help message and exit
--img_dir IMG_DIR path to image directory (default: 'input')
--out_dir OUT_DIR path to output directory (default: IMG_DIR)
--max_mp MAX_MP maximum megapixels (default: 1.5)
--quality QUALITY save quality (default: 95, range: 0-100, suggested: 90+)
--overwrite overwrite files in output directory
--noresize do not resize, just fix orientation
--delete delete original files after processing
The most basic use will load images from the local `input` directory, scale and rotate all the images, then write them back to the same folder. Default size is 1.5 megapixels.
python scripts/compress_img.py
To specify the image source directory, specify the `--img_dir`:
python scripts/compress_img.py --img_dir Q:\big_images
To save compressed images to a different path, specify the `--out_dir`:
python scripts/compress_img.py --img_dir Q:\big_images --out_dir Q:\small_images
If a specific image already exists in the output path, **it will be skipped**. For example, if you run the script twice, existing `.webp` images in the output directory will be skipped entirely. To overwrite existing files, use the `--overwrite` directive:
python scripts/compress_img.py --img_dir Q:\big_images --overwrite
If you want to ensure no files are skipped, *without overwriting existing images,* use `--out_dir` to specify an empty output folder.
If you want to delete the *original source image* after it has been resized, use the `--delete` directive:
python scripts/compress_img.py --img_dir Q:\big_images --delete
The `--delete` directive will not delete the original if it was overwritten or skipped.
To change the max megapixels, use the `--max_mp` option. For example, to set max megapixels to 2.0 and overwrite existing images in the output directory, see this example:
python scripts/compress_img.py --img_dir Q:\big_images --out_dir Q:\small_images --max_mp 2.0 --overwrite
Once you are comfortable with what is going on and OK with removing original images, you can use this to just replace everything in-place (these are my preferred settings):
python scripts/compress_img.py --img_dir Q:\big_images --max_mp 1.5 --quality 99 --overwrite --delete
This will compress all images in the `Q:\big_images` directory down to a maximum `1.5` megapixels, at quality `99`, and will `overwrite` any existing output images and `delete` the original, un-altered image.