File size: 5,922 Bytes
514db2a
 
d926073
 
 
 
 
 
 
 
 
 
cba2c16
 
 
 
 
 
d926073
4581d83
 
 
 
926552c
4581d83
 
00e069b
 
b7544f4
609d705
377440c
609d705
0e414e7
b060779
 
0e414e7
2e25e36
0e414e7
 
 
609d705
 
2e25e36
 
 
 
609d705
377440c
609d705
cca89ef
5bc6274
cca89ef
609d705
dcb0179
609d705
dcb0179
 
 
b7544f4
5bc6274
609d705
 
377440c
609d705
4581d83
 
 
f091ad0
4581d83
 
e932857
8ca8c6c
 
 
4327b81
4581d83
 
6bcf28d
4327b81
a6da21e
538d923
 
 
 
 
 
 
 
 
 
a6da21e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
57d3a6d
e932857
a6da21e
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
---
license: mit
datasets:
- hearmeneigh/e621-rising-v3-curated
- hearmeneigh/e621-rising-v3-finetuner
library_name: diffusers
pipeline_tag: text-to-image
tags:
- anthro
- furry
- e621
- nsfw
- booru
- imagebooru
- imageboard
- gelbooru
- danbooru
- rule34
- not-for-all-audiences
---

<div style='background: #ffeef1; border: 1px solid #fd91a4; padding:1em; border-radius:3px; margin-bottom:2em;'>
<h3 style='margin:0'>NSFW</h3>
<p style='margin:0'>This model  is not suitable for use by minors. The model can and will produce X-rated/NFSW content.</p>
</div>


<div style='background: #eefaff; border: 1px solid #91cefd; padding:1em; border-radius:3px; margin-bottom:2em;'>
<h1 style='margin:0'>Quickstart</h1>
<div style='margin:0; margin-top: 1em; margin-left:1em;'>
  <h3 style='margin:0'>Downloads</h3>
    <div style="margin-left: 1em;">
      <a href="https://huggingface.co/hearmeneigh/e621-rising-v3-safetensors/resolve/main/e621-rising-v3-epoch-34.fp16.safetensors" style="text-decoration: none !important;">

      </a>
      <a href="https://huggingface.co/hearmeneigh/e621-rising-v3-safetensors/resolve/main/e621-rising-v3-epoch-34.fp16.safetensors" style="text-decoration: underline">Checkpoint</a>
      <small style='padding-left: 0.5em'>
        (<a href="https://huggingface.co/hearmeneigh/e621-rising-v3-safetensors/resolve/main/e621-rising-v3-epoch-34.safetensors">fp32</a> | 
        <a href="https://huggingface.co/hearmeneigh/e621-rising-v3-safetensors/resolve/main/e621-rising-v3-epoch-34.fp16.safetensors">fp16</a> | 
        <a href="https://huggingface.co/hearmeneigh/e621-rising-v3-safetensors/resolve/main/e621-rising-v3-epoch-34.bf16.safetensors">bf16</a>)
      </small>
      <br />
      <a href="https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-preliminary-data/resolve/main/rising-v3.csv" style="text-decoration: none !important">

      </a>
      <a href="https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-preliminary-data/resolve/main/rising-v3.csv">Tag Autocomplete CSV</a>
    </div>
  <h3 style='margin:0; margin-top: 1em;'>Reference</h3>
    <div style='margin-left: 1em;'>
      • <a href="https://huggingface.co/hearmeneigh/e621-rising-v3/blob/main/INSTALL.md">Installation instructions</a>
      <br />
      • <a href="https://huggingface.co/hearmeneigh/e621-rising-v3/blob/main/CHANGELOG.md">What's new in v3?</a>
      <br />
      • <a href="https://huggingface.co/hearmeneigh/e621-rising-v3/blob/main/PROMPT-EXAMPLES.md" style='display: inline-block; margin-top:0.5em'>Prompt examples</a>
      <br />
      • <a href="https://huggingface.co/hearmeneigh/e621-rising-v3/blob/main/PROMPT-GUIDE.md">Prompt guide</a>
      <br />
      • <a href="https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-preliminary-data/blob/main/tag-counts.by-name.json" style='display: inline-block; margin-top:0.5em'>Tag list</a>
      <br />
      • <a href="https://huggingface.co/hearmeneigh/e621-rising-v3/blob/main/AUTOCOMPLETE.md">Tag autocomplete guide</a>
    </div>
</div>
</div>

# E621 Rising V3 (SDXL)
* Furry / anthro base model trained with images (mainly) from [E621](https://e621.net) 
* Guaranteed **NSFW** or your money back
* Stable Diffusion XL 1.0 model:
  * `1024x1024px`
  * Trained with 11 epochs of 280,000 images each
  * Finetuned with 23 epochs of 40,000 images each
* Compatible with [🤗 `diffusers`](https://huggingface.co/docs/diffusers/index)
* Compatible with [`stable-diffusion-webui`](https://github.com/AUTOMATIC1111/stable-diffusion-webui)
* Fully open source crawl, dataset, curation, and training process:
  * Use these tools to train your own version with your own dataset!
  * [Configuration](https://github.com/hearmeneigh/e621-rising-configs)
  * [Toolchain](https://github.com/hearmeneigh/dataset-rising)
  * [Dataset](https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-curated)


## Examples
For more examples, [continue here](PROMPT-EXAMPLES.md).

<img src='https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-preliminary-data/resolve/main/examples/singles/03-birb.jpg' alt='Birb' style='max-width:512px;' />
<img src='https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-preliminary-data/resolve/main/examples/singles/02-royal-stallion.jpg' alt='Royal Stallion' style='max-width:512px;' />
<img src='https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-preliminary-data/resolve/main/examples/singles/06-assassin.jpg' alt='Assassin' style='max-width:512px;' />
<img src='https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-preliminary-data/resolve/main/examples/singles/05-neekedneko.jpg' alt='Neeked Neko' style='max-width:512px;' />

For more examples, [continue here](PROMPT-EXAMPLES.md).

## Training Procedure

[Training legend](TBD)

* 160 images per batch (epoch variant)
* `1024x1024px` image size
* Adam optimizer
  * Beta1 = `0.9`
  * Beta2 = `0.999`
  * Weight decay = `1e-2`
  * Epsilon = `1e-08`
* Constant learning rate `4e-6`
* `fp16` mixed precision
* SNR gamma set to `5.0`
* Noise offset set to `0.07`
* `cosine_with_restarts` scheduler
* 11 epochs of [V3 curated dataset](https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-curated) samples resized to `< 1024x1024px` (maintain aspect ratio)
* 16 epochs of [V3 finetuner dataset](https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-finetuner) samples resized to `< 1024x1024px` (maintain aspect ratio)
* 6 epochs of [V3 finetuner dataset](https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-finetuner) samples resized to `< 1024x1024px` (maintain aspect ratio, randomly drop 70% of tags)
* 1 epoch of [V3 finetuner dataset](https://huggingface.co/datasets/hearmeneigh/e621-rising-v3-finetuner) samples resized to `< 1024x1024px` (maintain aspect ratio, randomly drop 50% of tags) and learning rate set to `4e-5`
* Tags for each sample are shuffled for each epoch