SeanForHim

Push model using huggingface_hub.

0d9d17b verified about 2 months ago

preview code

raw history blame contribute delete

No virus

363 Bytes

metadata

license: apache-2.0
tags:
  - trl
  - ddpo
  - diffusers
  - reinforcement-learning
  - text-to-image
  - stable-diffusion

TRL DDPO Model

This is a diffusion model that has been fine-tuned with reinforcement learning to guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text.