Edit model card
# DDPO trained model
num_epochs=10 

train_gradient_accumulation_steps=1 

sample_num_steps=30 

sample_batch_size=16 

train_batch_size=16 

sample_num_batches_per_epoch=32 

based off of stabilityai/stable-diffusion-2-base
and then trained off of None
Downloads last month
89