README.md · Raelina/Raehoshi-illust-XL-3 at main

metadata

license: other
license_name: faipl-1.0-sd
license_link: https://freedevproject.org/faipl-1.0-sd/
language:
  - en
tags:
  - text-to-image
  - stable-diffusion
  - safetensors
  - stable-diffusion-xl

Raehoshi illust XL 3

Overview

Introducing Raehoshi illust XL 3 , an enhanced iteration built upon the IllustriousXL model. It aims to elevate the visual style by addressing some of the limitations in the original, such as oversaturation and artifact noise. While these issues are not entirely eliminated, noticeable improvements have been made. The goal is to deliver a more polished, balanced output while staying true to the strengths of the base model.

Model Details

Developed by: Raelina
Model type: Diffusion-based text-to-image generative model
Model prompt style: Booru-tags
License: Fair AI Public License 1.0-SD
Finetuned from: Illustrious XL v0.1

Recommended settings

Positive prompts:

masterpiece, best quality, very aesthetic,

Special tags: "dramatic lighting"

Negative prompts:

lowres, bad quality, worst quality, bad anatomy, sketch, jpeg artifacts, signature, watermark,

CFG: 5-6
Sampling steps: 28
Sampler: Euler a
Supported Resolution:

1024 x 1024, 1152 x 896, 896 x 1152, 1216 x 832, 832 x 1216, 1344 x 768, 768 x 1344, 1536 x 640, 640 x 1536

Hires.fix Setting

Upscaler: 4x_NMKD-YandereNeoXL
Hires step: 10-15
Denoising: 0.1-0.3

Training config

The model was developed using a two-stage fine-tuning process. In Stage 1, new series and characters were introduced into the model. Stage 2 focused on fixing issues and enhancing the overall style for improved output.

Stage 1

Dataset : 34k
Hardware : 2x H100 80gb
Batch size : 32
Gradient accumulation steps : 2
Learning rate : 5e-6
Text encoder : 2.5e-6
Epoch : 10

Stage 2

Dataset : 2.3k
Hardware : 1x A100 80gb
Batch size : 48
Gradient accumulation steps : 1
Learning rate : 3e-6
Text encoder : disable
Epoch : 10

License

Fair AI Public License 1.0-SD