Edit model card

ddpo-incompressibility

This model was finetuned from Stable Diffusion v1-4 using DDPO and a reward function encouraging images that are not JPEG-compressible. See the project website for more details.

The model was finetuned for 20 iterations with a batch size of 256 samples per iteration. During finetuning, it was prompted with all of the animals in the Imagenet-1000 categories (the first 398 categories), but it exhibits some generalization to other prompts.

Downloads last month
9