adding 3D style

by Guizmus - opened Nov 21, 2022

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

+76

-98730

Files changed (23) hide show

README.md +64 -22
unet/diffusion_pytorch_model.bin → ckpt/3DStyle-v1.ckpt +2 -2
safety_checker/pytorch_model.bin → ckpt/3DStyle-v1_with_optimizers.ckpt +2 -2
AnimeChanStyle_v1.ckpt → ckpt/AnimeChanStyle-v1.ckpt +0 -0
AnimeChanStyle-v2.ckpt → ckpt/AnimeChanStyle-v2.ckpt +0 -0
text_encoder/pytorch_model.bin → datasets/3DChanStyle-v1.zip +2 -2
AnimeChan Style.zip → datasets/AnimeChanStyle-v1.zip +0 -0
vae/diffusion_pytorch_model.bin → datasets/AnimeChanStyle-v2.zip +2 -2
feature_extractor/preprocessor_config.json +0 -20
images/showcase_3DChanStyle-v1.jpg +0 -0
showcase.jpg → images/showcase_AnimeChan-v1.jpg +0 -0
showcase_AnimeChan.jpg → images/showcase_AnimeChan-v2.jpg +0 -0
images/showcase_main.jpg +0 -0
model_index.json +0 -32
safety_checker/config.json +0 -179
scheduler/scheduler_config.json +0 -12
text_encoder/config.json +0 -25
tokenizer/merges.txt +0 -0
tokenizer/special_tokens_map.json +0 -24
tokenizer/tokenizer_config.json +0 -34
tokenizer/vocab.json +0 -0
unet/config.json +0 -36
vae/config.json +0 -29

README.md CHANGED Viewed

@@ -2,46 +2,88 @@
 language:
 - en
 license: creativeml-openrail-m
-thumbnail: "https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/showcase_AnimeChan.jpg"
 tags:
 - stable-diffusion
 - text-to-image
 - image-to-image
 library_name: "EveryDream"
-datasets:
-- Guizmus/AnimeChanStyle
 inference: false
 ---
 # AnimeChan Style
-<p>
-	<img alt="Showcase" src="https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/showcase_AnimeChan.jpg"/><br/>
-	This model was based on <a href="https://huggingface.co/naclbit/trinart_stable_diffusion_v2">Trinart</a> model.<br/>
-	The dataset was a collaborative effort of the Stable Diffusion #anime channel, made of pictures from the users themselves using their different techniques.<br/>
-	100 total pictures in the dataset, 300 repeats total each, over 6 Epoch on LR1e-6.<br/>
-	This was trained using EveryDream with a full caption of all training pictures. The dataset can be found <a href="https://huggingface.co/datasets/Guizmus/AnimeChanStyle">here</a>.<br/>
-	<br/>
-	The style will be called by the use of the token <b>AnimeChan Style</b>.<br/>
-	<br/>
-	To access this model, you can download the CKPT file below, or use the <a href="https://huggingface.co/Guizmus/AnimeChanStyle/tree/main">diffusers</a>
-</p>
-[current v2 download link](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/AnimeChanStyle-v2.ckpt)
-[dataset for the second version](https://huggingface.co/datasets/Guizmus/AnimeChanStyle)
-## First version
-![first version showcase](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/showcase.jpg)
-[first version download link](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/AnimeChanStyle_v1.ckpt)
-[dataset for the first version](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/AnimeChan%20Style.zip)
-## License
-This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
 The CreativeML OpenRAIL License specifies:
 1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content

 language:
 - en
 license: creativeml-openrail-m
+thumbnail: "https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/images/showcase_main.jpg"
 tags:
 - stable-diffusion
 - text-to-image
 - image-to-image
 library_name: "EveryDream"
 inference: false
 ---
+# Introduction
+This is a collection of models made from and for the users of the Stable Diffusion Discord server. Different categories of channel exist, the "Dreamers Communities" presenting a panel of subjects, like Anime, 3D, or Architectural. Each of these channels has users posting images made through the use of Stable diffusion. After asking the users, and, depending on the activity of each channel, collecting a dataset from new submissions or from the history of the channel, I intend to build multiple models representing the style of each, so that users can produce things in the style they like and mix it with other things more easily.
+Those are mainly done through the use of EveryDream, and should result in a Mega Model towards the end for the datasets that are compatible. Some model like the Anime one require to stay on a different starting point, and may not get merged.
+# 3DChanStyle Style
+## Dataset & training
+This model was based on [RunwayML SD 1.5](https://huggingface.co/runwayml/stable-diffusion-v1-5) model with updated VAE.
+The dataset was a collaborative effort of the Stable Diffusion #3D channel, made of pictures from the users themselves using their different techniques.
+120 total pictures in the dataset, 500 repeats total each, over 10 Epoch on LR1e-6.
+This was trained using EveryDream with a full caption of all training pictures.
+The style will be called by the use of the token **3D Style**.
+Other significant tokens : rick roll, fullbody shot, bad cosplay man
+## Showcase & Downloads
+![Showcase](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/images/showcase_3DChanStyle-v1.jpg)
+[CKPT (2GB)](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/ckpt/3DStyle-v1.ckpt)
+[CKPT with training optimizers (11GB)](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/ckpt/3DStyle-v1_with_optimizers.ckpt)
+[Dataset](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/datasets/3DChanStyle-v1.zip)
 # AnimeChan Style
+## Dataset & training
+This model was based on [Trinart](https://huggingface.co/naclbit/trinart_stable_diffusion_v2) model.
+The dataset was a collaborative effort of the Stable Diffusion #anime channel, made of pictures from the users themselves using their different techniques.
+100 total pictures in the dataset, 300 repeats total each, over 6 Epoch on LR1e-6.
+This was trained using EveryDream with a full caption of all training pictures.
+The style will be called by the use of the token **AnimeChan Style**.
+## Downloads v2
+![Showcase](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/images/showcase_AnimeChan-v2.jpg)
+[CKPT (2GB)](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/ckpt/AnimeChanStyle-v2.ckpt)
+[Dataset](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/datasets/AnimeChanStyle-v2.zip)
+## Downloads v1
+![Showcase](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/images/showcase_AnimeChan-v1.jpg)
+[CKPT (2GB)](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/ckpt/AnimeChanStyle-v1.ckpt)
+[Dataset](https://huggingface.co/Guizmus/SD_DreamerCommunities_Collection/resolve/main/datasets/AnimeChanStyle-v1.zip)
+# License
+These models are open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
 The CreativeML OpenRAIL License specifies:
 1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content

unet/diffusion_pytorch_model.bin → ckpt/3DStyle-v1.ckpt RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a8ab275da85b3ff348e1e4a60cda864dc14acbb0676dd3f813160913c29bd740
-size 3438366373

 version https://git-lfs.github.com/spec/v1
+oid sha256:9698c46f614733b95d75cae29e777be15b2f13758f92d8eb113d566d79b665a6
+size 2132888989

safety_checker/pytorch_model.bin → ckpt/3DStyle-v1_with_optimizers.ckpt RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:16d28f2b37109f222cdc33620fdd262102ac32112be0352a7f77e9614b35a394
-size 1216064769

 version https://git-lfs.github.com/spec/v1
+oid sha256:8bca5d3ef403ad1ecfa7d3c9c5717a3bd5fa7899f93cd5526e4d55bbbd380147
+size 12126930715

AnimeChanStyle_v1.ckpt → ckpt/AnimeChanStyle-v1.ckpt RENAMED Viewed

File without changes

AnimeChanStyle-v2.ckpt → ckpt/AnimeChanStyle-v2.ckpt RENAMED Viewed

File without changes

text_encoder/pytorch_model.bin → datasets/3DChanStyle-v1.zip RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a2d9696908f0799f233577504bd321350000c0ccf236df5f9ebb7561a24af46e
-size 492307041

 version https://git-lfs.github.com/spec/v1
+oid sha256:0b659800b6b5140f8e953be05d7acd6d02296dbcbbd43dae944af84b2c5d131a
+size 29246736

AnimeChan Style.zip → datasets/AnimeChanStyle-v1.zip RENAMED Viewed

File without changes

vae/diffusion_pytorch_model.bin → datasets/AnimeChanStyle-v2.zip RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6723bacd3c60b11a2b4e6007338a54c6964c210116c3ccecb3bfc80e218afc8f
-size 334711857

 version https://git-lfs.github.com/spec/v1
+oid sha256:09f5678e615988ce35d01ce9980a546568e27ec52065d2578c265275b070ea27
+size 98956573

feature_extractor/preprocessor_config.json DELETED Viewed

@@ -1,20 +0,0 @@
-{
-  "crop_size": 224,
-  "do_center_crop": true,
-  "do_convert_rgb": true,
-  "do_normalize": true,
-  "do_resize": true,
-  "feature_extractor_type": "CLIPFeatureExtractor",
-  "image_mean": [
-    0.48145466,
-    0.4578275,
-    0.40821073
-  ],
-  "image_std": [
-    0.26862954,
-    0.26130258,
-    0.27577711
-  ],
-  "resample": 3,
-  "size": 224
-}

images/showcase_3DChanStyle-v1.jpg ADDED Viewed

showcase.jpg → images/showcase_AnimeChan-v1.jpg RENAMED Viewed

File without changes

showcase_AnimeChan.jpg → images/showcase_AnimeChan-v2.jpg RENAMED Viewed

File without changes

images/showcase_main.jpg ADDED Viewed

model_index.json DELETED Viewed

@@ -1,32 +0,0 @@
-{
-  "_class_name": "StableDiffusionPipeline",
-  "_diffusers_version": "0.8.0.dev0",
-  "feature_extractor": [
-    "transformers",
-    "CLIPFeatureExtractor"
-  ],
-  "safety_checker": [
-    "stable_diffusion",
-    "StableDiffusionSafetyChecker"
-  ],
-  "scheduler": [
-    "diffusers",
-    "PNDMScheduler"
-  ],
-  "text_encoder": [
-    "transformers",
-    "CLIPTextModel"
-  ],
-  "tokenizer": [
-    "transformers",
-    "CLIPTokenizer"
-  ],
-  "unet": [
-    "diffusers",
-    "UNet2DConditionModel"
-  ],
-  "vae": [
-    "diffusers",
-    "AutoencoderKL"
-  ]
-}

safety_checker/config.json DELETED Viewed

@@ -1,179 +0,0 @@
-{
-  "_commit_hash": "4bb648a606ef040e7685bde262611766a5fdd67b",
-  "_name_or_path": "CompVis/stable-diffusion-safety-checker",
-  "architectures": [
-    "StableDiffusionSafetyChecker"
-  ],
-  "initializer_factor": 1.0,
-  "logit_scale_init_value": 2.6592,
-  "model_type": "clip",
-  "projection_dim": 768,
-  "text_config": {
-    "_name_or_path": "",
-    "add_cross_attention": false,
-    "architectures": null,
-    "attention_dropout": 0.0,
-    "bad_words_ids": null,
-    "begin_suppress_tokens": null,
-    "bos_token_id": 0,
-    "chunk_size_feed_forward": 0,
-    "cross_attention_hidden_size": null,
-    "decoder_start_token_id": null,
-    "diversity_penalty": 0.0,
-    "do_sample": false,
-    "dropout": 0.0,
-    "early_stopping": false,
-    "encoder_no_repeat_ngram_size": 0,
-    "eos_token_id": 2,
-    "exponential_decay_length_penalty": null,
-    "finetuning_task": null,
-    "forced_bos_token_id": null,
-    "forced_eos_token_id": null,
-    "hidden_act": "quick_gelu",
-    "hidden_size": 768,
-    "id2label": {
-      "0": "LABEL_0",
-      "1": "LABEL_1"
-    },
-    "initializer_factor": 1.0,
-    "initializer_range": 0.02,
-    "intermediate_size": 3072,
-    "is_decoder": false,
-    "is_encoder_decoder": false,
-    "label2id": {
-      "LABEL_0": 0,
-      "LABEL_1": 1
-    },
-    "layer_norm_eps": 1e-05,
-    "length_penalty": 1.0,
-    "max_length": 20,
-    "max_position_embeddings": 77,
-    "min_length": 0,
-    "model_type": "clip_text_model",
-    "no_repeat_ngram_size": 0,
-    "num_attention_heads": 12,
-    "num_beam_groups": 1,
-    "num_beams": 1,
-    "num_hidden_layers": 12,
-    "num_return_sequences": 1,
-    "output_attentions": false,
-    "output_hidden_states": false,
-    "output_scores": false,
-    "pad_token_id": 1,
-    "prefix": null,
-    "problem_type": null,
-    "pruned_heads": {},
-    "remove_invalid_values": false,
-    "repetition_penalty": 1.0,
-    "return_dict": true,
-    "return_dict_in_generate": false,
-    "sep_token_id": null,
-    "suppress_tokens": null,
-    "task_specific_params": null,
-    "temperature": 1.0,
-    "tf_legacy_loss": false,
-    "tie_encoder_decoder": false,
-    "tie_word_embeddings": true,
-    "tokenizer_class": null,
-    "top_k": 50,
-    "top_p": 1.0,
-    "torch_dtype": null,
-    "torchscript": false,
-    "transformers_version": "4.24.0",
-    "typical_p": 1.0,
-    "use_bfloat16": false,
-    "vocab_size": 49408
-  },
-  "text_config_dict": {
-    "hidden_size": 768,
-    "intermediate_size": 3072,
-    "num_attention_heads": 12,
-    "num_hidden_layers": 12
-  },
-  "torch_dtype": "float32",
-  "transformers_version": null,
-  "vision_config": {
-    "_name_or_path": "",
-    "add_cross_attention": false,
-    "architectures": null,
-    "attention_dropout": 0.0,
-    "bad_words_ids": null,
-    "begin_suppress_tokens": null,
-    "bos_token_id": null,
-    "chunk_size_feed_forward": 0,
-    "cross_attention_hidden_size": null,
-    "decoder_start_token_id": null,
-    "diversity_penalty": 0.0,
-    "do_sample": false,
-    "dropout": 0.0,
-    "early_stopping": false,
-    "encoder_no_repeat_ngram_size": 0,
-    "eos_token_id": null,
-    "exponential_decay_length_penalty": null,
-    "finetuning_task": null,
-    "forced_bos_token_id": null,
-    "forced_eos_token_id": null,
-    "hidden_act": "quick_gelu",
-    "hidden_size": 1024,
-    "id2label": {
-      "0": "LABEL_0",
-      "1": "LABEL_1"
-    },
-    "image_size": 224,
-    "initializer_factor": 1.0,
-    "initializer_range": 0.02,
-    "intermediate_size": 4096,
-    "is_decoder": false,
-    "is_encoder_decoder": false,
-    "label2id": {
-      "LABEL_0": 0,
-      "LABEL_1": 1
-    },
-    "layer_norm_eps": 1e-05,
-    "length_penalty": 1.0,
-    "max_length": 20,
-    "min_length": 0,
-    "model_type": "clip_vision_model",
-    "no_repeat_ngram_size": 0,
-    "num_attention_heads": 16,
-    "num_beam_groups": 1,
-    "num_beams": 1,
-    "num_channels": 3,
-    "num_hidden_layers": 24,
-    "num_return_sequences": 1,
-    "output_attentions": false,
-    "output_hidden_states": false,
-    "output_scores": false,
-    "pad_token_id": null,
-    "patch_size": 14,
-    "prefix": null,
-    "problem_type": null,
-    "pruned_heads": {},
-    "remove_invalid_values": false,
-    "repetition_penalty": 1.0,
-    "return_dict": true,
-    "return_dict_in_generate": false,
-    "sep_token_id": null,
-    "suppress_tokens": null,
-    "task_specific_params": null,
-    "temperature": 1.0,
-    "tf_legacy_loss": false,
-    "tie_encoder_decoder": false,
-    "tie_word_embeddings": true,
-    "tokenizer_class": null,
-    "top_k": 50,
-    "top_p": 1.0,
-    "torch_dtype": null,
-    "torchscript": false,
-    "transformers_version": "4.24.0",
-    "typical_p": 1.0,
-    "use_bfloat16": false
-  },
-  "vision_config_dict": {
-    "hidden_size": 1024,
-    "intermediate_size": 4096,
-    "num_attention_heads": 16,
-    "num_hidden_layers": 24,
-    "patch_size": 14
-  }
-}

scheduler/scheduler_config.json DELETED Viewed

@@ -1,12 +0,0 @@
-{
-  "_class_name": "PNDMScheduler",
-  "_diffusers_version": "0.8.0.dev0",
-  "beta_end": 0.012,
-  "beta_schedule": "scaled_linear",
-  "beta_start": 0.00085,
-  "num_train_timesteps": 1000,
-  "set_alpha_to_one": false,
-  "skip_prk_steps": true,
-  "steps_offset": 1,
-  "trained_betas": null
-}

text_encoder/config.json DELETED Viewed

@@ -1,25 +0,0 @@
-{
-  "_name_or_path": "openai/clip-vit-large-patch14",
-  "architectures": [
-    "CLIPTextModel"
-  ],
-  "attention_dropout": 0.0,
-  "bos_token_id": 0,
-  "dropout": 0.0,
-  "eos_token_id": 2,
-  "hidden_act": "quick_gelu",
-  "hidden_size": 768,
-  "initializer_factor": 1.0,
-  "initializer_range": 0.02,
-  "intermediate_size": 3072,
-  "layer_norm_eps": 1e-05,
-  "max_position_embeddings": 77,
-  "model_type": "clip_text_model",
-  "num_attention_heads": 12,
-  "num_hidden_layers": 12,
-  "pad_token_id": 1,
-  "projection_dim": 768,
-  "torch_dtype": "float32",
-  "transformers_version": "4.24.0",
-  "vocab_size": 49408
-}

tokenizer/merges.txt DELETED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/special_tokens_map.json DELETED Viewed

@@ -1,24 +0,0 @@
-{
-  "bos_token": {
-    "content": "<|startoftext|>",
-    "lstrip": false,
-    "normalized": true,
-    "rstrip": false,
-    "single_word": false
-  },
-  "eos_token": {
-    "content": "<|endoftext|>",
-    "lstrip": false,
-    "normalized": true,
-    "rstrip": false,
-    "single_word": false
-  },
-  "pad_token": "<|endoftext|>",
-  "unk_token": {
-    "content": "<|endoftext|>",
-    "lstrip": false,
-    "normalized": true,
-    "rstrip": false,
-    "single_word": false
-  }
-}

tokenizer/tokenizer_config.json DELETED Viewed

@@ -1,34 +0,0 @@
-{
-  "add_prefix_space": false,
-  "bos_token": {
-    "__type": "AddedToken",
-    "content": "<|startoftext|>",
-    "lstrip": false,
-    "normalized": true,
-    "rstrip": false,
-    "single_word": false
-  },
-  "do_lower_case": true,
-  "eos_token": {
-    "__type": "AddedToken",
-    "content": "<|endoftext|>",
-    "lstrip": false,
-    "normalized": true,
-    "rstrip": false,
-    "single_word": false
-  },
-  "errors": "replace",
-  "model_max_length": 77,
-  "name_or_path": "openai/clip-vit-large-patch14",
-  "pad_token": "<|endoftext|>",
-  "special_tokens_map_file": "./special_tokens_map.json",
-  "tokenizer_class": "CLIPTokenizer",
-  "unk_token": {
-    "__type": "AddedToken",
-    "content": "<|endoftext|>",
-    "lstrip": false,
-    "normalized": true,
-    "rstrip": false,
-    "single_word": false
-  }
-}

tokenizer/vocab.json DELETED Viewed

The diff for this file is too large to render. See raw diff

unet/config.json DELETED Viewed

@@ -1,36 +0,0 @@
-{
-  "_class_name": "UNet2DConditionModel",
-  "_diffusers_version": "0.8.0.dev0",
-  "act_fn": "silu",
-  "attention_head_dim": 8,
-  "block_out_channels": [
-    320,
-    640,
-    1280,
-    1280
-  ],
-  "center_input_sample": false,
-  "cross_attention_dim": 768,
-  "down_block_types": [
-    "CrossAttnDownBlock2D",
-    "CrossAttnDownBlock2D",
-    "CrossAttnDownBlock2D",
-    "DownBlock2D"
-  ],
-  "downsample_padding": 1,
-  "flip_sin_to_cos": true,
-  "freq_shift": 0,
-  "in_channels": 4,
-  "layers_per_block": 2,
-  "mid_block_scale_factor": 1,
-  "norm_eps": 1e-05,
-  "norm_num_groups": 32,
-  "out_channels": 4,
-  "sample_size": 32,
-  "up_block_types": [
-    "UpBlock2D",
-    "CrossAttnUpBlock2D",
-    "CrossAttnUpBlock2D",
-    "CrossAttnUpBlock2D"
-  ]
-}

vae/config.json DELETED Viewed

@@ -1,29 +0,0 @@
-{
-  "_class_name": "AutoencoderKL",
-  "_diffusers_version": "0.8.0.dev0",
-  "act_fn": "silu",
-  "block_out_channels": [
-    128,
-    256,
-    512,
-    512
-  ],
-  "down_block_types": [
-    "DownEncoderBlock2D",
-    "DownEncoderBlock2D",
-    "DownEncoderBlock2D",
-    "DownEncoderBlock2D"
-  ],
-  "in_channels": 3,
-  "latent_channels": 4,
-  "layers_per_block": 2,
-  "norm_num_groups": 32,
-  "out_channels": 3,
-  "sample_size": 256,
-  "up_block_types": [
-    "UpDecoderBlock2D",
-    "UpDecoderBlock2D",
-    "UpDecoderBlock2D",
-    "UpDecoderBlock2D"
-  ]
-}