Edit model card

ddpo-compressibility

This model was finetuned from Stable Diffusion v1-4 using DDPO and a reward function encouraging images that are JPEG-compressible. See the project website for more details.

The model was finetuned for 60 iterations with a batch size of 256 samples per iteration. During finetuning, it was prompted with all of the animals in the Imagenet-1000 categories (the first 398 categories), but it exhibits some generalization to other prompts.

Downloads last month
51