Make sure to set guidance scale to 1.0

from diffusers import AutoPipelineForText2Image
import torch

pipe = AutoPipelineForText2Image.from_pretrained("stabilityai/sdxl-turbo", torch_dtype=torch.float16, variant="fp16")
pipe.to("cuda")

prompt = "A cinematic shot of a baby racoon wearing an intricate italian priest robe."
steps = 1

image = pipe(prompt=prompt, num_inference_steps=steps, guidance_scale=0.0).images[0]
patrickvonplaten changed pull request status to open

The model produces 512x512 by default. Getting identical results now with demo:

1. Demo

f134668621b5d1032194c0b3ba321c835433494a2efa07a0c810e693.jpg

2. Diffusers with above code snippet:

GvntVWWlpj.png

does it work with img2img and controlnet pipelines?

Yes, Image-to-Image works (see notebook)

@patrickvonplaten Thanks, I just tested it with CPU, and it takes 10 seconds to generate an image on Core i7. I will integrate it with FastSD CPU.

patrickvonplaten changed pull request status to merged

Sign up or log in to comment