Upload 15 files

Browse files

Files changed (15) hide show

README.md +135 -0
SDArt_catstravaganza.ckpt +3 -0
dataset.zip +3 -0
model_index.json +33 -0
scheduler/scheduler_config.json +16 -0
text_encoder/config.json +25 -0
text_encoder/pytorch_model.bin +3 -0
tokenizer/merges.txt +0 -0
tokenizer/special_tokens_map.json +24 -0
tokenizer/tokenizer_config.json +35 -0
tokenizer/vocab.json +0 -0
unet/config.json +51 -0
unet/diffusion_pytorch_model.bin +3 -0
vae/config.json +31 -0
vae/diffusion_pytorch_model.bin +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,138 @@
 ---
 license: creativeml-openrail-m
 ---

 ---
+language:
+- en
 license: creativeml-openrail-m
+thumbnail: "https://huggingface.co/Guizmus/SDArt_Catstravaganza/resolve/main/showcase.jpg"
+tags:
+- stable-diffusion
+- text-to-image
+- image-to-image
 ---
+# PoW : CATSTRAVAGANZA !
+![Showcase](https://huggingface.co/Guizmus/SDArt_Catstravaganza/resolve/main/showcase.jpg)
+## Theme
+> Orb-like eyes, mischief, and surprise
+> Cat paws tip-a-tap at your side
+> With their fanged teeth and coy mystique
+> They gaze up at you ever so sweet
+> Won’t you give them a treat?
+This POW’s all about our favorite feline friends wherein we are absolutely not biased. Be as creative as you can!
+Whether it be a cat made of cotton candy, a grumpy cat made of storm clouds, or just your regular degular tabby cat. Bring them all here and may the best cat reign supreme!
+“It’s raining cats and cats out here!”
+* Join us for a cat-stravaganza of fun
+* Cute and cuddly - show us your paw-sitively purrfect creations!
+* Whisk-er away your doubts and unleash your cat-titude
+* Wander into the canvas alongside your meow-gnificant feline friends!
+## Model description
+This is a model related to the "Picture of the Week" contest on Stable Diffusion discord.
+I try to make a model out of all the submission for people to continue enjoy the theme after the even, and see a little of their designs in other people's creations. The main token just changed to "SDArt", usually staying the same week to week. I balance the learning on the low side, so that it doesn't just replicate creations.
+The total dataset is made of 61 pictures. It was trained on [Stable diffusion 1.5](https://huggingface.co/runwayml/stable-diffusion-v1-5). I used [EveryDream](https://github.com/victorchall/EveryDream2trainer) to do the training, 100 total repeat per picture. The pictures were tagged using the token "SDArt", and an arbitrary token I choose for each submission. The dataset is provided below, as well as a list of usernames and their corresponding token.
+The recommended sampling is UniPC on 15 steps, CFGS 5 .
+## Trained tokens
+* SDArt
+* **User** => **Token**
+* Aiska#5641 => dyce
+* Akumetsu971#9982 => ohwx
+* alex1729#4669 => bnp
+* andrekerygma#4898 => tobi
+* balrogdx#7106 => keel
+* BeardedWhale#3993 => conv
+* beta_caojin#9338 => cov
+* bitspirit3#1653 => aten
+* Brett#2283 => stg
+* bWm_nubby#6416 => fcu
+* csunberry#0594 => nwsl
+* Dexyel#9083 => cous
+* Dreamck#2108 => gare
+* Dries#2321 => mth
+* dunkeroni#4269 => nrf
+* DylanWalker#9705 => elio
+* Eface#8250 => gani
+* Eppinette-Chi#6220 => pfa
+* espasmo#9486 => kprc
+* Guizmus#9881 => kuro
+* Havok#2933 => ndi
+* Horvallis#7915 => asot
+* Jeremy#6194 => sill
+* joachim#4676 => coyi
+* Junglerally#3955 => bsp
+* katmoget#4491 => psst
+* Kaz#5485 => sqm
+* King Pendragon#3589 => irgc
+* Kingpin#2557 => buka
+* Lenny_dV#6843 => lili
+* Lord Parfington#0012 => hns
+* marjan2k#5277 => cq
+* MooDFooD#6245 => bdg
+* mr.katkot#5949 => scd
+* Munkyfoot#7944 => byes
+* NorTroll#1798 => dany
+* Omnia#2931 => yler
+* Onusai#6441 => aroa
+* owleye#1290 => avel
+* Phaff#1970 => vaw
+* piscabo#8649 => zaki
+* qushkitti#0978 => muna
+* Redinhead#3342 => ewe
+* ResidentChiefNZ#6989 => guin
+* Sam#8080 => erra
+* SirVeggie#0230 => urd
+* Smol Sus#0909 => nasi
+* sometimes#8916 => vini
+* SpaceCypher#6900 => fbs
+* tazi#2574 => crit
+* Trash--Panda#6213 => doa
+* ulla_diffusion#0451 => mlas
+* UniversityEuphoric#3284 => ylor
+* uwyne#1111 => isch
+* vcm07#7281 => phol
+* VereVolf#5658 => vedi
+* Vil#0404 => dds
+* Wipeout#0031 => acu
+* wpatzz#5836 => pte
+* xThIsIsBoToXx#8765 => oxi
+* yamination#7634 => bsgo
+## Download links
+[SafeTensors](https://huggingface.co/Guizmus/SDArt_Catstravaganza/resolve/main/SDArt-Catstravaganza.safetensors)
+[CKPT](https://huggingface.co/Guizmus/SDArt_Catstravaganza/resolve/main/SDArt-Catstravaganza.ckpt)
+[Dataset](https://huggingface.co/Guizmus/SDArt_Catstravaganza/resolve/main/dataset.zip)
+## 🧨 Diffusers
+This model can be used just like any other Stable Diffusion model. For more information,
+please have a look at the [Stable Diffusion](https://huggingface.co/docs/diffusers/api/pipelines/stable_diffusion).
+You can also export the model to [ONNX](https://huggingface.co/docs/diffusers/optimization/onnx), [MPS](https://huggingface.co/docs/diffusers/optimization/mps) and/or [FLAX/JAX]().
+```python
+from diffusers import StableDiffusionPipeline
+import torch
+model_id = "Guizmus/SDArt_Catstravaganza"
+pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
+pipe = pipe.to("cuda")
+prompt = "SDArt dyce"
+image = pipe(prompt).images[0]
+image.save("./SDArt.png")
+```

SDArt_catstravaganza.ckpt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1af4fc3f55a7d32fab870ade39cbc1968c77bc6813fa8fbcb437ad24ef6b6098
+size 2132856622

dataset.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fc3ce847951d5d0e3b7ac235543475ad318520541709ff9c2c1aa954f71e54e7
+size 57236722

model_index.json ADDED Viewed

	@@ -0,0 +1,33 @@

+{
+  "_class_name": "StableDiffusionPipeline",
+  "_diffusers_version": "0.13.0",
+  "feature_extractor": [
+    null,
+    null
+  ],
+  "requires_safety_checker": null,
+  "safety_checker": [
+    null,
+    null
+  ],
+  "scheduler": [
+    "diffusers",
+    "DDPMScheduler"
+  ],
+  "text_encoder": [
+    "transformers",
+    "CLIPTextModel"
+  ],
+  "tokenizer": [
+    "transformers",
+    "CLIPTokenizer"
+  ],
+  "unet": [
+    "diffusers",
+    "UNet2DConditionModel"
+  ],
+  "vae": [
+    "diffusers",
+    "AutoencoderKL"
+  ]
+}

scheduler/scheduler_config.json ADDED Viewed

	@@ -0,0 +1,16 @@

+{
+  "_class_name": "DDPMScheduler",
+  "_diffusers_version": "0.13.0",
+  "beta_end": 0.012,
+  "beta_schedule": "scaled_linear",
+  "beta_start": 0.00085,
+  "clip_sample": false,
+  "clip_sample_range": 1.0,
+  "num_train_timesteps": 1000,
+  "prediction_type": "epsilon",
+  "set_alpha_to_one": false,
+  "skip_prk_steps": true,
+  "steps_offset": 1,
+  "trained_betas": null,
+  "variance_type": "fixed_small"
+}

text_encoder/config.json ADDED Viewed

	@@ -0,0 +1,25 @@

+{
+  "_name_or_path": "F:\\AI\\Data\\Diffusers\\stable-diffusion-v1-5",
+  "architectures": [
+    "CLIPTextModel"
+  ],
+  "attention_dropout": 0.0,
+  "bos_token_id": 0,
+  "dropout": 0.0,
+  "eos_token_id": 2,
+  "hidden_act": "quick_gelu",
+  "hidden_size": 768,
+  "initializer_factor": 1.0,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 77,
+  "model_type": "clip_text_model",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "projection_dim": 768,
+  "torch_dtype": "float32",
+  "transformers_version": "4.25.1",
+  "vocab_size": 49408
+}

text_encoder/pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c3e3ed691b4a373924e3caf02a5a960b2f0dd855f05b99add977f1f693e184ba
+size 492308087

tokenizer/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": {
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "<|endoftext|>",
+  "unk_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,35 @@

+{
+  "add_prefix_space": false,
+  "bos_token": {
+    "__type": "AddedToken",
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "do_lower_case": true,
+  "eos_token": {
+    "__type": "AddedToken",
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "errors": "replace",
+  "model_max_length": 77,
+  "name_or_path": "F:\\AI\\Data\\Diffusers\\stable-diffusion-v1-5",
+  "pad_token": "<|endoftext|>",
+  "special_tokens_map_file": "./special_tokens_map.json",
+  "tokenizer_class": "CLIPTokenizer",
+  "unk_token": {
+    "__type": "AddedToken",
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "use_fast": false
+}

tokenizer/vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff

unet/config.json ADDED Viewed

	@@ -0,0 +1,51 @@

+{
+  "_class_name": "UNet2DConditionModel",
+  "_diffusers_version": "0.13.0",
+  "_name_or_path": "F:\\AI\\Data\\Diffusers\\stable-diffusion-v1-5",
+  "act_fn": "silu",
+  "attention_head_dim": 8,
+  "block_out_channels": [
+    320,
+    640,
+    1280,
+    1280
+  ],
+  "center_input_sample": false,
+  "class_embed_type": null,
+  "conv_in_kernel": 3,
+  "conv_out_kernel": 3,
+  "cross_attention_dim": 768,
+  "down_block_types": [
+    "CrossAttnDownBlock2D",
+    "CrossAttnDownBlock2D",
+    "CrossAttnDownBlock2D",
+    "DownBlock2D"
+  ],
+  "downsample_padding": 1,
+  "dual_cross_attention": false,
+  "flip_sin_to_cos": true,
+  "freq_shift": 0,
+  "in_channels": 4,
+  "layers_per_block": 2,
+  "mid_block_scale_factor": 1,
+  "mid_block_type": "UNetMidBlock2DCrossAttn",
+  "norm_eps": 1e-05,
+  "norm_num_groups": 32,
+  "num_class_embeds": null,
+  "only_cross_attention": false,
+  "out_channels": 4,
+  "projection_class_embeddings_input_dim": null,
+  "resnet_time_scale_shift": "default",
+  "sample_size": 64,
+  "time_cond_proj_dim": null,
+  "time_embedding_type": "positional",
+  "timestep_post_act": null,
+  "up_block_types": [
+    "UpBlock2D",
+    "CrossAttnUpBlock2D",
+    "CrossAttnUpBlock2D",
+    "CrossAttnUpBlock2D"
+  ],
+  "upcast_attention": false,
+  "use_linear_projection": false
+}

unet/diffusion_pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:00d7da30a574dcff216c42527b1c80af73fad0e92ed2e2afb5c6fe5c84ed1c5e
+size 3438364325

vae/config.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "_class_name": "AutoencoderKL",
+  "_diffusers_version": "0.13.0",
+  "_name_or_path": "F:\\AI\\Data\\Diffusers\\stable-diffusion-v1-5",
+  "act_fn": "silu",
+  "block_out_channels": [
+    128,
+    256,
+    512,
+    512
+  ],
+  "down_block_types": [
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D"
+  ],
+  "in_channels": 3,
+  "latent_channels": 4,
+  "layers_per_block": 2,
+  "norm_num_groups": 32,
+  "out_channels": 3,
+  "sample_size": 512,
+  "scaling_factor": 0.18215,
+  "up_block_types": [
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D"
+  ]
+}

vae/diffusion_pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:eb128b1f37e0c381c440128b217d29613b3e08b9e4ea7f20466424145ba538b0
+size 167402961