Kardbord commited on May 16, 2023

Commit

dc85ca0

•

1 Parent(s): fbaa942

Upload folder using huggingface_hub (#1)

- 97d68c49ee781b9ee8669f4cc50c79bd2c15ce0d5dcae7e28bc76d46a28559f9 (b32ef921a83cd3451c9eecffaa0a67c785deb089)
- 4b9aea23b42c1971a95dd51cb2975abf11d9f31c37edba35fd79525f7f85645c (dd24eb13f1f4a76ae007b0d963b89e738559828c)
- 46d2187ce299b2a916ca2e0c36715a7739d2b6e272396f19b47127dd7c8ebd78 (c163d6359eccc89da60df375e3d2e775a3322254)
- 5a9879665e7c5baae2f24679deec50f5d95b11d1df79dd85cfacc21260abbc58 (d58f48f49de661c4716fbe59f67dea6b6ecb3906)
- 0fd121886c7477d5bfb932d60e3a251c871b8c63b591fb57040bf0eaef4072c2 (04d153f8ea0de4f75d9aef4459c0d1966d8eeba1)
- 404aac17a0336138e7d27f0e8f443d82a4254559d77e0dfc7ca1e54209c1b4d0 (6c9f99a9b5e90ce50a68a26080682baf143b0197)

Files changed (22) hide show

Model Weights.png +0 -0
ProtoGen_X3.4-pruned-fp16.ckpt +3 -0
ProtoGen_X3.4-pruned-fp16.safetensors +3 -0
ProtoGen_X3.4.ckpt +3 -0
ProtoGen_X3.4.safetensors +3 -0
Protogen_x3.4-512.png +0 -0
README.md +365 -0
feature_extractor/preprocessor_config.json +28 -0
model_index.json +1 -0
safety_checker/config.json +181 -0
safety_checker/pytorch_model.bin +3 -0
scheduler/scheduler_config.json +14 -0
text_encoder/config.json +25 -0
text_encoder/pytorch_model.bin +3 -0
tokenizer/merges.txt +0 -0
tokenizer/special_tokens_map.json +24 -0
tokenizer/tokenizer_config.json +34 -0
tokenizer/vocab.json +0 -0
unet/config.json +44 -0
unet/diffusion_pytorch_model.bin +3 -0
vae/config.json +30 -0
vae/diffusion_pytorch_model.bin +3 -0

Model Weights.png ADDED Viewed

ProtoGen_X3.4-pruned-fp16.ckpt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5346d7de1f448e6953a12e9c186f3996ac07b6e1ea6076fc242bc484b48b7c95
+size 1886665781

ProtoGen_X3.4-pruned-fp16.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ef8629e2c89e19a993146302418cf1ff3ba0384dd98523eab6b88ac33ead9d39
+size 1886474920

ProtoGen_X3.4.ckpt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:61a37adf761fbbf4cb3d88da480216341113d0fbcf9f0f77ea71863866a9b6fc
+size 5984615834

ProtoGen_X3.4.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:44f90a09727ca8b62ea304e140546a0af96ba6edcb229c20c677aa4460449c21
+size 5984232961

Protogen_x3.4-512.png ADDED Viewed

README.md ADDED Viewed

	@@ -0,0 +1,365 @@

+---
+language:
+- en
+license: creativeml-openrail-m
+tags:
+- stable-diffusion
+- stable-diffusion-diffusers
+- text-to-image
+- art
+- artistic
+- diffusers
+- protogen
+inference: true
+---
+# Overview
+This is simply darkstorm2150/Protogen_x3.4_Official_Release with the safety checker disabled.
+**DO NOT** attempt to use this model to generate harmful or illegal content.
+<center><img src="https://huggingface.co/darkstorm2150/Protogen_x3.4_Official_Release/resolve/main/Protogen_x3.4-512.png" style="height:690px; border-radius: 8%; border: 10px solid #663380; padding-top:0px;" span title="Protogen x3.4 Raw Output"></center>
+<center><h1>Protogen x3.4 (Photorealism) Official Release</h1></center>
+<center><p><em>Research Model by <a href="https://instagram.com/officialvictorespinoza">darkstorm2150</a></em></p></center>
+</div>
+## Table of contents
+* [General info](#general-info)
+* [Granular Adaptive Learning](#granular-adaptive-learning)
+* [Trigger Words](#trigger-words)
+* [Setup](#setup)
+* [Space](#space)
+* [CompVis](#compvis)
+* [Diffusers](#🧨-diffusers)
+* [Checkpoint Merging Data Reference](#checkpoint-merging-data-reference)
+* [License](#license)
+## General info
+Protogen x3.4
+Protogen was warm-started with [Stable Diffusion v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5) and fine-tuned on various high quality image datasets.
+Version 3.4 continued training from [ProtoGen v2.2](https://huggingface.co/darkstorm2150/Protogen_v2.2_Official_Release) with added photorealism.
+## Granular Adaptive Learning
+Granular adaptive learning is a machine learning technique that focuses on adjusting the learning process at a fine-grained level, rather than making global adjustments to the model. This approach allows the model to adapt to specific patterns or features in the data, rather than making assumptions based on general trends.
+Granular adaptive learning can be achieved through techniques such as active learning, which allows the model to select the data it wants to learn from, or through the use of reinforcement learning, where the model receives feedback on its performance and adapts based on that feedback. It can also be achieved through techniques such as online learning where the model adjust itself as it receives more data.
+Granular adaptive learning is often used in situations where the data is highly diverse or non-stationary and where the model needs to adapt quickly to changing patterns. This is often the case in dynamic environments such as robotics, financial markets, and natural language processing.
+## Trigger Words
+modelshoot style, analog style, mdjrny-v4 style, nousr robot
+Trigger words are available for the hassan1.4 and f222, might have to google them :)
+## Setup
+To run this model, download the model.ckpt or model.safetensor and install it in your "stable-diffusion-webui\models\Stable-diffusion" directory
+## Space
+We support a [Gradio](https://github.com/gradio-app/gradio) Web UI:
+[![Open In Spaces](https://camo.githubusercontent.com/00380c35e60d6b04be65d3d94a58332be5cc93779f630bcdfc18ab9a3a7d3388/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f25463025394625413425393725323048756767696e67253230466163652d5370616365732d626c7565)](https://huggingface.co/spaces/darkstorm2150/Stable-Diffusion-Protogen-webui)
+### CompVis
+## CKPT
+[Download ProtoGen x3.4.ckpt (5.98GB)](https://huggingface.co/darkstorm2150/Protogen_x3.4_Official_Release/resolve/main/ProtoGen_X3.4.ckpt)
+[Download ProtoGen X3.4-pruned-fp16.ckpt (1.89 GB)](https://huggingface.co/darkstorm2150/Protogen_x3.4_Official_Release/resolve/main/ProtoGen_X3.4-pruned-fp16.ckpt)
+## Safetensors
+[Download ProtoGen x3.4.safetensors (5.98GB)](https://huggingface.co/darkstorm2150/Protogen_x3.4_Official_Release/resolve/main/ProtoGen_X3.4.safetensors)
+[Download ProtoGen x3.4-pruned-fp16.safetensors (1.89GB)](https://huggingface.co/darkstorm2150/Protogen_x3.4_Official_Release/resolve/main/ProtoGen_X3.4-pruned-fp16.safetensors)
+### 🧨 Diffusers
+This model can be used just like any other Stable Diffusion model. For more information,
+please have a look at the [Stable Diffusion Pipeline](https://huggingface.co/docs/diffusers/api/pipelines/stable_diffusion).
+```python
+from diffusers import StableDiffusionPipeline, DPMSolverMultistepScheduler
+import torch
+prompt = (
+"modelshoot style, (extremely detailed CG unity 8k wallpaper), full shot body photo of the most beautiful artwork in the world, "
+"english medieval witch, black silk vale, pale skin, black silk robe, black cat, necromancy magic, medieval era, "
+"photorealistic painting by Ed Blinkey, Atey Ghailan, Studio Ghibli, by Jeremy Mann, Greg Manchess, Antonio Moro, trending on ArtStation, "
+"trending on CGSociety, Intricate, High Detail, Sharp focus, dramatic, photorealistic painting art by midjourney and greg rutkowski"
+)
+model_id = "darkstorm2150/Protogen_x3.4_Official_Release"
+pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
+pipe.scheduler = DPMSolverMultistepScheduler.from_config(pipe.scheduler.config)
+pipe = pipe.to("cuda")
+image = pipe(prompt, num_inference_steps=25).images[0]
+image.save("./result.jpg")
+```
+![img](https://huggingface.co/datasets/patrickvonplaten/images/resolve/main/protogen/rswf5qk9be9a1.jpg)
+## PENDING DATA FOR MERGE, RPGv2 not accounted..
+## Checkpoint Merging Data Reference
+<style>
+.myTable {
+border-collapse:collapse;
+}
+.myTable th {
+background-color:#663380;
+color:white;
+}
+.myTable td, .myTable th {
+padding:5px;
+border:1px solid #663380;
+}
+</style>
+<table class="myTable">
+<tr>
+<th>Models</th>
+<th>Protogen v2.2 (Anime)</th>
+<th>Protogen x3.4 (Photo)</th>
+<th>Protogen x5.3 (Photo)</th>
+<th>Protogen x5.8 (Sci-fi/Anime)</th>
+<th>Protogen x5.9 (Dragon)</th>
+<th>Protogen x7.4 (Eclipse)</th>
+<th>Protogen x8.0 (Nova)</th>
+<th>Protogen x8.6 (Infinity)</th>
+</tr>
+<tr>
+<td>seek_art_mega v1</td>
+<td>52.50%</td>
+<td>42.76%</td>
+<td>42.63%</td>
+<td></td>
+<td></td>
+<td></td>
+<td>25.21%</td>
+<td>14.83%</td>
+</tr>
+<tr>
+<td>modelshoot v1</td>
+<td>30.00%</td>
+<td>24.44%</td>
+<td>24.37%</td>
+<td>2.56%</td>
+<td>2.05%</td>
+<td>3.48%</td>
+<td>22.91%</td>
+<td>13.48%</td>
+</tr>
+<tr>
+<td>elldreth v1</td>
+<td>12.64%</td>
+<td>10.30%</td>
+<td>10.23%</td>
+<td></td>
+<td></td>
+<td></td>
+<td>6.06%</td>
+<td>3.57%</td>
+</tr>
+<tr>
+<td>photoreal v2</td>
+<td></td>
+<td></td>
+<td>10.00%</td>
+<td>48.64%</td>
+<td>38.91%</td>
+<td>66.33%</td>
+<td>20.49%</td>
+<td>12.06%</td>
+</tr>
+<tr>
+<td>analogdiffusion v1</td>
+<td></td>
+<td>4.75%</td>
+<td>4.50%</td>
+<td></td>
+<td></td>
+<td></td>
+<td>1.75%</td>
+<td>1.03%</td>
+</tr>
+<tr>
+<td>openjourney v2</td>
+<td></td>
+<td>4.51%</td>
+<td>4.28%</td>
+<td></td>
+<td></td>
+<td>4.75%</td>
+<td>2.26%</td>
+<td>1.33%</td>
+</tr>
+<tr>
+<td>hassan1.4</td>
+<td>2.63%</td>
+<td>2.14%</td>
+<td>2.13%</td>
+<td></td>
+<td></td>
+<td></td>
+<td>1.26%</td>
+<td>0.74%</td>
+</tr>
+<tr>
+<td>f222</td>
+<td>2.23%</td>
+<td>1.82%</td>
+<td>1.81%</td>
+<td></td>
+<td></td>
+<td></td>
+<td>1.07%</td>
+<td>0.63%</td>
+</tr>
+<tr>
+<td>hasdx</td>
+<td></td>
+<td></td>
+<td></td>
+<td>20.00%</td>
+<td>16.00%</td>
+<td>4.07%</td>
+<td>5.01%</td>
+<td>2.95%</td>
+</tr>
+<tr>
+<td>moistmix</td>
+<td></td>
+<td></td>
+<td></td>
+<td>16.00%</td>
+<td>12.80%</td>
+<td>3.86%</td>
+<td>4.08%</td>
+<td>2.40%</td>
+</tr>
+<tr>
+<td>roboDiffusion v1</td>
+<td></td>
+<td>4.29%</td>
+<td></td>
+<td>12.80%</td>
+<td>10.24%</td>
+<td>3.67%</td>
+<td>4.41%</td>
+<td>2.60%</td>
+</tr>
+<tr>
+<td>RPG v3</td>
+<td></td>
+<td>5.00%</td>
+<td></td>
+<td></td>
+<td>20.00%</td>
+<td>4.29%</td>
+<td>4.29%</td>
+<td>2.52%</td>
+</tr>
+<tr>
+<td>anything&everything</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>4.51%</td>
+<td>0.56%</td>
+<td>0.33%</td>
+</tr>
+<tr>
+<td>dreamlikediff v1</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>5.0%</td>
+<td>0.63%</td>
+<td>0.37%</td>
+</tr>
+<tr>
+<td>sci-fidiff v1</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>3.10%</td>
+</tr>
+<tr>
+<td>synthwavepunk v2</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>3.26%</td>
+</tr>
+<tr>
+<td>mashupv2</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>11.51%</td>
+</tr>
+<tr>
+<td>dreamshaper 252</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>4.04%</td>
+</tr>
+<tr>
+<td>comicdiff v2</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>4.25%</td>
+</tr>
+<tr>
+<td>artEros</td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td></td>
+<td>15.00%</td>
+</tr>
+</table>
+## License
+By downloading you agree to the terms of these licenses
+<a href="https://huggingface.co/spaces/CompVis/stable-diffusion-license">CreativeML Open RAIL-M</a>
+<a href="https://huggingface.co/coreco/seek.art_MEGA/blob/main/LICENSE.txt">Seek Art Mega License</a>

feature_extractor/preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "crop_size": {
+    "height": 224,
+    "width": 224
+  },
+  "do_center_crop": true,
+  "do_convert_rgb": true,
+  "do_normalize": true,
+  "do_rescale": true,
+  "do_resize": true,
+  "feature_extractor_type": "CLIPFeatureExtractor",
+  "image_mean": [
+    0.48145466,
+    0.4578275,
+    0.40821073
+  ],
+  "image_processor_type": "CLIPFeatureExtractor",
+  "image_std": [
+    0.26862954,
+    0.26130258,
+    0.27577711
+  ],
+  "resample": 3,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "shortest_edge": 224
+  }
+}

model_index.json ADDED Viewed

	@@ -0,0 +1 @@

+ {"_class_name": "StableDiffusionPipeline", "_diffusers_version": "0.12.0.dev0", "feature_extractor": ["transformers", "CLIPImageProcessor"], "requires_safety_checker": false, "safety_checker": [null, null], "scheduler": ["diffusers", "PNDMScheduler"], "text_encoder": ["transformers", "CLIPTextModel"], "tokenizer": ["transformers", "CLIPTokenizer"], "unet": ["diffusers", "UNet2DConditionModel"], "vae": ["diffusers", "AutoencoderKL"]}

safety_checker/config.json ADDED Viewed

	@@ -0,0 +1,181 @@

+{
+  "_commit_hash": "cb41f3a270d63d454d385fc2e4f571c487c253c5",
+  "_name_or_path": "CompVis/stable-diffusion-safety-checker",
+  "architectures": [
+    "StableDiffusionSafetyChecker"
+  ],
+  "initializer_factor": 1.0,
+  "logit_scale_init_value": 2.6592,
+  "model_type": "clip",
+  "projection_dim": 768,
+  "text_config": {
+    "_name_or_path": "",
+    "add_cross_attention": false,
+    "architectures": null,
+    "attention_dropout": 0.0,
+    "bad_words_ids": null,
+    "begin_suppress_tokens": null,
+    "bos_token_id": 0,
+    "chunk_size_feed_forward": 0,
+    "cross_attention_hidden_size": null,
+    "decoder_start_token_id": null,
+    "diversity_penalty": 0.0,
+    "do_sample": false,
+    "dropout": 0.0,
+    "early_stopping": false,
+    "encoder_no_repeat_ngram_size": 0,
+    "eos_token_id": 2,
+    "exponential_decay_length_penalty": null,
+    "finetuning_task": null,
+    "forced_bos_token_id": null,
+    "forced_eos_token_id": null,
+    "hidden_act": "quick_gelu",
+    "hidden_size": 768,
+    "id2label": {
+      "0": "LABEL_0",
+      "1": "LABEL_1"
+    },
+    "initializer_factor": 1.0,
+    "initializer_range": 0.02,
+    "intermediate_size": 3072,
+    "is_decoder": false,
+    "is_encoder_decoder": false,
+    "label2id": {
+      "LABEL_0": 0,
+      "LABEL_1": 1
+    },
+    "layer_norm_eps": 1e-05,
+    "length_penalty": 1.0,
+    "max_length": 20,
+    "max_position_embeddings": 77,
+    "min_length": 0,
+    "model_type": "clip_text_model",
+    "no_repeat_ngram_size": 0,
+    "num_attention_heads": 12,
+    "num_beam_groups": 1,
+    "num_beams": 1,
+    "num_hidden_layers": 12,
+    "num_return_sequences": 1,
+    "output_attentions": false,
+    "output_hidden_states": false,
+    "output_scores": false,
+    "pad_token_id": 1,
+    "prefix": null,
+    "problem_type": null,
+    "projection_dim": 512,
+    "pruned_heads": {},
+    "remove_invalid_values": false,
+    "repetition_penalty": 1.0,
+    "return_dict": true,
+    "return_dict_in_generate": false,
+    "sep_token_id": null,
+    "suppress_tokens": null,
+    "task_specific_params": null,
+    "temperature": 1.0,
+    "tf_legacy_loss": false,
+    "tie_encoder_decoder": false,
+    "tie_word_embeddings": true,
+    "tokenizer_class": null,
+    "top_k": 50,
+    "top_p": 1.0,
+    "torch_dtype": null,
+    "torchscript": false,
+    "transformers_version": "4.26.0.dev0",
+    "typical_p": 1.0,
+    "use_bfloat16": false,
+    "vocab_size": 49408
+  },
+  "text_config_dict": {
+    "hidden_size": 768,
+    "intermediate_size": 3072,
+    "num_attention_heads": 12,
+    "num_hidden_layers": 12
+  },
+  "torch_dtype": "float32",
+  "transformers_version": null,
+  "vision_config": {
+    "_name_or_path": "",
+    "add_cross_attention": false,
+    "architectures": null,
+    "attention_dropout": 0.0,
+    "bad_words_ids": null,
+    "begin_suppress_tokens": null,
+    "bos_token_id": null,
+    "chunk_size_feed_forward": 0,
+    "cross_attention_hidden_size": null,
+    "decoder_start_token_id": null,
+    "diversity_penalty": 0.0,
+    "do_sample": false,
+    "dropout": 0.0,
+    "early_stopping": false,
+    "encoder_no_repeat_ngram_size": 0,
+    "eos_token_id": null,
+    "exponential_decay_length_penalty": null,
+    "finetuning_task": null,
+    "forced_bos_token_id": null,
+    "forced_eos_token_id": null,
+    "hidden_act": "quick_gelu",
+    "hidden_size": 1024,
+    "id2label": {
+      "0": "LABEL_0",
+      "1": "LABEL_1"
+    },
+    "image_size": 224,
+    "initializer_factor": 1.0,
+    "initializer_range": 0.02,
+    "intermediate_size": 4096,
+    "is_decoder": false,
+    "is_encoder_decoder": false,
+    "label2id": {
+      "LABEL_0": 0,
+      "LABEL_1": 1
+    },
+    "layer_norm_eps": 1e-05,
+    "length_penalty": 1.0,
+    "max_length": 20,
+    "min_length": 0,
+    "model_type": "clip_vision_model",
+    "no_repeat_ngram_size": 0,
+    "num_attention_heads": 16,
+    "num_beam_groups": 1,
+    "num_beams": 1,
+    "num_channels": 3,
+    "num_hidden_layers": 24,
+    "num_return_sequences": 1,
+    "output_attentions": false,
+    "output_hidden_states": false,
+    "output_scores": false,
+    "pad_token_id": null,
+    "patch_size": 14,
+    "prefix": null,
+    "problem_type": null,
+    "projection_dim": 512,
+    "pruned_heads": {},
+    "remove_invalid_values": false,
+    "repetition_penalty": 1.0,
+    "return_dict": true,
+    "return_dict_in_generate": false,
+    "sep_token_id": null,
+    "suppress_tokens": null,
+    "task_specific_params": null,
+    "temperature": 1.0,
+    "tf_legacy_loss": false,
+    "tie_encoder_decoder": false,
+    "tie_word_embeddings": true,
+    "tokenizer_class": null,
+    "top_k": 50,
+    "top_p": 1.0,
+    "torch_dtype": null,
+    "torchscript": false,
+    "transformers_version": "4.26.0.dev0",
+    "typical_p": 1.0,
+    "use_bfloat16": false
+  },
+  "vision_config_dict": {
+    "hidden_size": 1024,
+    "intermediate_size": 4096,
+    "num_attention_heads": 16,
+    "num_hidden_layers": 24,
+    "patch_size": 14
+  }
+}

safety_checker/pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:16d28f2b37109f222cdc33620fdd262102ac32112be0352a7f77e9614b35a394
+size 1216064769

scheduler/scheduler_config.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+  "_class_name": "PNDMScheduler",
+  "_diffusers_version": "0.12.0.dev0",
+  "beta_end": 0.012,
+  "beta_schedule": "scaled_linear",
+  "beta_start": 0.00085,
+  "clip_sample": false,
+  "num_train_timesteps": 1000,
+  "prediction_type": "epsilon",
+  "set_alpha_to_one": false,
+  "skip_prk_steps": true,
+  "steps_offset": 1,
+  "trained_betas": null
+}

text_encoder/config.json ADDED Viewed

	@@ -0,0 +1,25 @@

+{
+  "_name_or_path": "openai/clip-vit-large-patch14",
+  "architectures": [
+    "CLIPTextModel"
+  ],
+  "attention_dropout": 0.0,
+  "bos_token_id": 0,
+  "dropout": 0.0,
+  "eos_token_id": 2,
+  "hidden_act": "quick_gelu",
+  "hidden_size": 768,
+  "initializer_factor": 1.0,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 77,
+  "model_type": "clip_text_model",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "projection_dim": 768,
+  "torch_dtype": "float32",
+  "transformers_version": "4.26.0.dev0",
+  "vocab_size": 49408
+}

text_encoder/pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:562a8a1222c3e3f73b802a3c52d866f97a79325a1a3189ec2fe49e5f54bc5a7b
+size 492307041

tokenizer/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": {
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "<|endoftext|>",
+  "unk_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,34 @@

+{
+  "add_prefix_space": false,
+  "bos_token": {
+    "__type": "AddedToken",
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "do_lower_case": true,
+  "eos_token": {
+    "__type": "AddedToken",
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "errors": "replace",
+  "model_max_length": 77,
+  "name_or_path": "openai/clip-vit-large-patch14",
+  "pad_token": "<|endoftext|>",
+  "special_tokens_map_file": "./special_tokens_map.json",
+  "tokenizer_class": "CLIPTokenizer",
+  "unk_token": {
+    "__type": "AddedToken",
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer/vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff

unet/config.json ADDED Viewed

	@@ -0,0 +1,44 @@

+{
+  "_class_name": "UNet2DConditionModel",
+  "_diffusers_version": "0.12.0.dev0",
+  "act_fn": "silu",
+  "attention_head_dim": 8,
+  "block_out_channels": [
+    320,
+    640,
+    1280,
+    1280
+  ],
+  "center_input_sample": false,
+  "class_embed_type": null,
+  "cross_attention_dim": 768,
+  "down_block_types": [
+    "CrossAttnDownBlock2D",
+    "CrossAttnDownBlock2D",
+    "CrossAttnDownBlock2D",
+    "DownBlock2D"
+  ],
+  "downsample_padding": 1,
+  "dual_cross_attention": false,
+  "flip_sin_to_cos": true,
+  "freq_shift": 0,
+  "in_channels": 4,
+  "layers_per_block": 2,
+  "mid_block_scale_factor": 1,
+  "mid_block_type": "UNetMidBlock2DCrossAttn",
+  "norm_eps": 1e-05,
+  "norm_num_groups": 32,
+  "num_class_embeds": null,
+  "only_cross_attention": false,
+  "out_channels": 4,
+  "resnet_time_scale_shift": "default",
+  "sample_size": 64,
+  "up_block_types": [
+    "UpBlock2D",
+    "CrossAttnUpBlock2D",
+    "CrossAttnUpBlock2D",
+    "CrossAttnUpBlock2D"
+  ],
+  "upcast_attention": false,
+  "use_linear_projection": false
+}

unet/diffusion_pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926c30ee1b8fb52ec8983427e9b2a23ab67ed29fab23ea5eb48c221cc331afbf
+size 3438366373

vae/config.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "_class_name": "AutoencoderKL",
+  "_diffusers_version": "0.12.0.dev0",
+  "act_fn": "silu",
+  "block_out_channels": [
+    128,
+    256,
+    512,
+    512
+  ],
+  "down_block_types": [
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D"
+  ],
+  "in_channels": 3,
+  "latent_channels": 4,
+  "layers_per_block": 2,
+  "norm_num_groups": 32,
+  "out_channels": 3,
+  "sample_size": 512,
+  "scaling_factor": 0.18215,
+  "up_block_types": [
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D"
+  ]
+}

vae/diffusion_pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3e9214a656c2445a921065a40861f6adfbe0aa8e0219785e5866f9eef0d5716f
+size 334711857