--- license: apache-2.0 tags: - trl - ddpo - diffusers - reinforcement-learning - text-to-image - stable-diffusion --- # TRL DDPO Model This is a diffusion model that has been fine-tuned with reinforcement learning to guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text.