SeanForHim's picture
Push model using huggingface_hub.
0d9d17b verified
metadata
license: apache-2.0
tags:
  - trl
  - ddpo
  - diffusers
  - reinforcement-learning
  - text-to-image
  - stable-diffusion

TRL DDPO Model

This is a diffusion model that has been fine-tuned with reinforcement learning to guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text.