for testing

#7
by Gerom - opened
.gitattributes CHANGED
@@ -33,5 +33,3 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  vae/kl-f8-anime.ckpt filter=lfs diff=lfs merge=lfs -text
34
  vae/kl-f8-anime2.ckpt filter=lfs diff=lfs merge=lfs -text
35
  models/wd-1-3-penultimate-ucg-cont.ckpt filter=lfs diff=lfs merge=lfs -text
36
- models/wd-1-4-float32-booru-110k.ckpt filter=lfs diff=lfs merge=lfs -text
37
- wd-1-4-anime_e1.ckpt filter=lfs diff=lfs merge=lfs -text
 
33
  vae/kl-f8-anime.ckpt filter=lfs diff=lfs merge=lfs -text
34
  vae/kl-f8-anime2.ckpt filter=lfs diff=lfs merge=lfs -text
35
  models/wd-1-3-penultimate-ucg-cont.ckpt filter=lfs diff=lfs merge=lfs -text
 
 
README.md CHANGED
@@ -1,47 +1,11 @@
1
  ---
2
- language:
3
- - en
4
- tags:
5
- - stable-diffusion
6
- - text-to-image
7
  license: creativeml-openrail-m
8
- inference: false
9
-
10
  ---
11
 
12
- ![image](https://user-images.githubusercontent.com/26317155/210155933-db3a5f1a-1ec3-4777-915c-6deff2841ce9.png)
13
-
14
- <sub>masterpiece, best quality, 1girl, green hair, sweater, looking at viewer, upper body, beanie, outdoors, watercolor, night, turtleneck</sub>
15
-
16
- # Waifu Diffusion v1.4
17
-
18
- Waifu Diffusion is a latent text-to-image diffusion model that has been conditioned on high-quality anime images through fine-tuning.
19
-
20
- - [Waifu Diffusion 1.4 Anime Epoch 1](https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/wd-1-4-anime_e1.ckpt): A test model made to properly ensure that the training setup works.
21
- - [Waifu Diffusion 1.4 Anime Inference Config](https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/wd-1-4-anime_e1.yaml): A file included to allow for inference with Automatic's WebUI and with the original Stable Diffusion codebase.
22
-
23
- ## License
24
-
25
- This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
26
- The CreativeML OpenRAIL License specifies:
27
-
28
- 1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content
29
- 2. The authors claims no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license
30
- 3. You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully)
31
- [Please read the full license here](https://huggingface.co/spaces/CompVis/stable-diffusion-license)
32
-
33
- ## Downstream Uses
34
-
35
- This model can be used for entertainment purposes and as a generative art assistant.
36
-
37
- ## Team Members and Acknowledgements
38
-
39
- This project would not have been possible without the incredible work by Stability AI and NovelAI.
40
-
41
- - [Haru](https://github.com/harubaru)
42
- - [Salt](https://github.com/sALTaccount/)
43
- - [Cafe](https://twitter.com/cafeai_labs)
44
-
45
- In order to reach us, you can join our [Discord server](https://discord.gg/touhouai).
46
 
47
- [![Discord Server](https://discordapp.com/api/guilds/930499730843250783/widget.png?style=banner2)](https://discord.gg/touhouai)
 
 
 
 
 
1
  ---
 
 
 
 
 
2
  license: creativeml-openrail-m
 
 
3
  ---
4
 
5
+ Future repo for the Waifu Diffusion 1.4 models.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
 
7
+ This repo will contain experimental models such as:
8
+ - An AI image detector
9
+ - A Img2Text Diffusion Prior
10
+ - Finetuned VAEs
11
+ - The actual latent diffusion model
models/wd-1-3-5_80000-fp32.ckpt DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:b438efac4434af4e482d20cdfcea64067f8dfec438628261d2f2aa60ffc41452
3
- size 4098020782
 
 
 
 
wd-1-4-anime_e1.ckpt DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:e8f4b2225b8ac6464e9d13d25b708349b1ca340c92dfc3575e8d5ef18f689457
3
- size 5160236290
 
 
 
 
wd-1-4-anime_e1.yaml DELETED
@@ -1,67 +0,0 @@
1
- model:
2
- base_learning_rate: 1.0e-4
3
- target: ldm.models.diffusion.ddpm.LatentDiffusion
4
- params:
5
- linear_start: 0.00085
6
- linear_end: 0.0120
7
- num_timesteps_cond: 1
8
- log_every_t: 200
9
- timesteps: 1000
10
- first_stage_key: "jpg"
11
- cond_stage_key: "txt"
12
- image_size: 64
13
- channels: 4
14
- cond_stage_trainable: false
15
- conditioning_key: crossattn
16
- monitor: val/loss_simple_ema
17
- scale_factor: 0.18215
18
- use_ema: False # we set this to false because this is an inference only config
19
-
20
- unet_config:
21
- target: ldm.modules.diffusionmodules.openaimodel.UNetModel
22
- params:
23
- use_checkpoint: True
24
- use_fp16: True
25
- image_size: 32 # unused
26
- in_channels: 4
27
- out_channels: 4
28
- model_channels: 320
29
- attention_resolutions: [ 4, 2, 1 ]
30
- num_res_blocks: 2
31
- channel_mult: [ 1, 2, 4, 4 ]
32
- num_head_channels: 64 # need to fix for flash-attn
33
- use_spatial_transformer: True
34
- use_linear_in_transformer: True
35
- transformer_depth: 1
36
- context_dim: 1024
37
- legacy: False
38
-
39
- first_stage_config:
40
- target: ldm.models.autoencoder.AutoencoderKL
41
- params:
42
- embed_dim: 4
43
- monitor: val/rec_loss
44
- ddconfig:
45
- #attn_type: "vanilla-xformers"
46
- double_z: true
47
- z_channels: 4
48
- resolution: 256
49
- in_channels: 3
50
- out_ch: 3
51
- ch: 128
52
- ch_mult:
53
- - 1
54
- - 2
55
- - 4
56
- - 4
57
- num_res_blocks: 2
58
- attn_resolutions: []
59
- dropout: 0.0
60
- lossconfig:
61
- target: torch.nn.Identity
62
-
63
- cond_stage_config:
64
- target: ldm.modules.encoders.modules.FrozenOpenCLIPEmbedder
65
- params:
66
- freeze: True
67
- layer: "penultimate"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wd-1-4-anime_e2.ckpt DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:c76e0962bc60ccdc18e0dce387635b472b5a19038d637216030acbbe6eda2713
3
- size 5160236290