Comparison of Stable Diffusion XL (SDXL) 0.9 vs 1.0 For DreamBooth Training - Surprising Results
You can download SDXL 0.9 from here : https://huggingface.co/stabilityai/stable-diffusion-xl-base-0.9/tree/main
SDXL 0.9 was the first released beta version of Stable Diffusion XL.
I have used Kohya GUI SS and the config I shared here for training : https://www.patreon.com/posts/89213064
Video of how to use config : https://youtu.be/EEV8RPohsbw
For training: 15 training images (show below), 140 repeat, 1 epoch (so total 151402 = 4200 steps - takes less than 2 hours on RTX 3090 with 17 GB VRAM) and the real unsplash manually collected reg images from here : https://www.patreon.com/posts/massive-4k-woman-87700469 are used
Both for SDXL 0.9 and SDXL 1.0 exactly same training parameters and configuration used. For SDXL 0.9 I used the embedded VAE and for SDXL 1.0 I used the later released VAE which is supposed to be same as SDXL 0.9 VAE.
You can download original full resolution (6194 x 4034 pixels) and quality PNG images from attachments and see their PNG info (only PNG ones some failed so I uploaded as JPG) from Automatic1111 SD Web UI PNG info tab.
Prompt 1 PNG Info:
Medium shot photo of ohwx man wearing a very expensive suit in a studio with good lightning , hd, hdr, 2k, 4k, uhd
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0
Prompt 2 PNG Info:
closeshot photo of ohwx man wearing a suit in a surreal outworldly garden, sunlight, hd, hdr, 2k, 4k, uhd
Negative prompt: sunglasses, cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0
Prompt 3 PNG Info:
cinematic photo ohwx man riding dinosaur in a jungle with mud, sunny day shiny clear sky 35mm photograph,film,professional,4k,highly detailed
Negative prompt: sunglasses, cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0
Prompt 4 PNG Info:
picture of (ohwx man) wearing a suit near a lake, simple flat color, 2 dimensional, flat 2d art style, cartoon
Negative prompt: photo, photograph, ugly, deformed, noisy, blurry, low contrast, realistic, distant shot, close shot, medium shot, 3d, cgi, render, studio shot, studio, shot, camera
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: "picture of (ohwx man), simple flat color, 2 dimensional, flat 2d art style", ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0
Prompt 5 PNG Info:
closeshot handsome photo of (ohwx man) (in a warrior armor ) in a coliseum, hdr, canon, hd, 8k, 4k, sharp focus
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 129509750, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0
Prompt 6 PNG Info:
photo of warrior ohwx man with a pet dragon , epic, cinematic, sunlight, hd, hdr, 2k, 4k, uhd
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 2991427470, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0
Prompt 7 PNG Info:
handsome portrait photo of (ohwx man) wearing a space armor on a space station, hdr, canon, hd, 8k, 4k, sharp focus
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 2897227315, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0