commit files to HF hub

Browse files

Files changed (15) hide show

README.md +72 -0
feature_extractor/preprocessor_config.json +28 -0
inference.py +18 -0
model_index.json +32 -0
scheduler/scheduler_config.json +14 -0
text_encoder/openvino_model.bin +3 -0
text_encoder/openvino_model.xml +0 -0
tokenizer/merges.txt +0 -0
tokenizer/special_tokens_map.json +24 -0
tokenizer/tokenizer_config.json +34 -0
tokenizer/vocab.json +0 -0
unet/openvino_model.bin +3 -0
unet/openvino_model.xml +0 -0
vae_decoder/openvino_model.bin +3 -0
vae_decoder/openvino_model.xml +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,72 @@

+---
+license: creativeml-openrail-m
+tags:
+- stable-diffusion
+- text-to-image
+- openvino
+extra_gated_prompt: |-
+ This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
+ The CreativeML OpenRAIL License specifies:
+ 1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content
+ 2. The authors claim no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license
+ 3. You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully)
+ Please read the full license carefully here: https://huggingface.co/spaces/CompVis/stable-diffusion-license
+extra_gated_heading: Please read the LICENSE to access this model
+---
+# OpenVINO Stable Diffusion
+## CompVis/stable-diffusion-v1-4
+This repository contains the models from [CompVis/stable-diffusion-v1-4](https://huggingface.co/CompVis/stable-diffusion-v1-4) converted to
+OpenVINO, for accelerated inference on CPU or Intel GPU with OpenVINO's integration into Optimum:
+[optimum-intel](https://github.com/huggingface/optimum-intel#openvino). The model weights are stored with FP16
+precision, which reduces the size of the model by half.
+Please check out the [source model repository](https://huggingface.co/CompVis/stable-diffusion-v1-4) for more information about the model and its license.
+To install the requirements for this demo, do `pip install optimum[openvino]`. This installs all the necessary dependencies,
+including Transformers and OpenVINO. For more detailed steps, please see this [installation guide](https://github.com/helena-intel/optimum-intel/wiki/OpenVINO-Integration-Installation-Guide).
+The simplest way to generate an image with stable diffusion takes only two lines of code, as shown below. The first line downloads the
+model from the Hugging Face hub (if it has not been downloaded before) and loads it; the second line generates an image.
+```python
+from optimum.intel.openvino import OVStableDiffusionPipeline
+stable_diffusion = OVStableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")
+images = stable_diffusion("a random image").images
+```
+The following example code uses static shapes for even faster inference. Using larger image sizes will
+require more memory and take longer to generate.
+If you have an 11th generation or later Intel Core processor, you can use the integrated GPU for inference, and if you have an Intel
+discrete GPU, you can use that. Add the line `stable_diffusion.to("GPU")` before `stable_diffusion.compile()` in the example below.
+Model loading will take some time the first time, but will be faster after that, because the model will be cached. On GPU, for stable
+diffusion only static shapes are supported at the moment.
+```python
+from optimum.intel.openvino.modeling_diffusion import OVStableDiffusionPipeline
+batch_size = 1
+num_images_per_prompt = 1
+height = 256
+width = 256
+# load the model and reshape to static shapes for faster inference
+model_id = "CompVis/stable-diffusion-v1-4"
+stable_diffusion = OVStableDiffusionPipeline.from_pretrained(model_id, compile=False)
+stable_diffusion.reshape( batch_size=batch_size, height=height, width=width, num_images_per_prompt=num_images_per_prompt)
+stable_diffusion.compile()
+# generate image!
+prompt = "a random image"
+images = stable_diffusion(prompt, height=height, width=width, num_images_per_prompt=num_images_per_prompt).images
+images[0].save("result.png")
+```

feature_extractor/preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "crop_size": {
+    "height": 224,
+    "width": 224
+  },
+  "do_center_crop": true,
+  "do_convert_rgb": true,
+  "do_normalize": true,
+  "do_rescale": true,
+  "do_resize": true,
+  "feature_extractor_type": "CLIPFeatureExtractor",
+  "image_mean": [
+    0.48145466,
+    0.4578275,
+    0.40821073
+  ],
+  "image_processor_type": "CLIPFeatureExtractor",
+  "image_std": [
+    0.26862954,
+    0.26130258,
+    0.27577711
+  ],
+  "resample": 3,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "shortest_edge": 224
+  }
+}

inference.py ADDED Viewed

	@@ -0,0 +1,18 @@

+from optimum.intel.openvino.modeling_diffusion import OVStableDiffusionPipeline
+batch_size = 1
+num_images_per_prompt = 1
+height = 256
+width = 256
+# load the model and reshape to static shapes for faster inference
+model_id = "helenai/CompVis-stable-diffusion-v1-4-ov"
+stable_diffusion = OVStableDiffusionPipeline.from_pretrained(model_id, compile=False)
+stable_diffusion.reshape( batch_size=batch_size, height=height, width=width, num_images_per_prompt=num_images_per_prompt)
+stable_diffusion.compile()
+# generate image!
+prompt = "a random image"
+images = stable_diffusion(prompt, height=height, width=width, num_images_per_prompt=num_images_per_prompt).images
+images[0].save("result.png")

model_index.json ADDED Viewed

	@@ -0,0 +1,32 @@

+{
+  "_class_name": "OVStableDiffusionPipeline",
+  "_diffusers_version": "0.13.1",
+  "feature_extractor": [
+    "transformers",
+    "CLIPFeatureExtractor"
+  ],
+  "safety_checker": [
+    "stable_diffusion",
+    "StableDiffusionSafetyChecker"
+  ],
+  "scheduler": [
+    "diffusers",
+    "PNDMScheduler"
+  ],
+  "text_encoder": [
+    "optimum",
+    "OVModelTextEncoder"
+  ],
+  "tokenizer": [
+    "transformers",
+    "CLIPTokenizer"
+  ],
+  "unet": [
+    "optimum",
+    "OVModelUnet"
+  ],
+  "vae_decoder": [
+    "optimum",
+    "OVModelVaeDecoder"
+  ]
+}

scheduler/scheduler_config.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+  "_class_name": "PNDMScheduler",
+  "_diffusers_version": "0.13.1",
+  "beta_end": 0.012,
+  "beta_schedule": "scaled_linear",
+  "beta_start": 0.00085,
+  "clip_sample": false,
+  "num_train_timesteps": 1000,
+  "prediction_type": "epsilon",
+  "set_alpha_to_one": false,
+  "skip_prk_steps": true,
+  "steps_offset": 1,
+  "trained_betas": null
+}

text_encoder/openvino_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8dedf0d36126919dd5b018997cbe0b3e8705166ebd06321114184242b73c98a4
+size 246121704

text_encoder/openvino_model.xml ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": {
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "<|endoftext|>",
+  "unk_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,34 @@

+{
+  "add_prefix_space": false,
+  "bos_token": {
+    "__type": "AddedToken",
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "do_lower_case": true,
+  "eos_token": {
+    "__type": "AddedToken",
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "errors": "replace",
+  "model_max_length": 77,
+  "name_or_path": "models/CompVis-stable-diffusion-v1-4-ov/tokenizer",
+  "pad_token": "<|endoftext|>",
+  "special_tokens_map_file": "./special_tokens_map.json",
+  "tokenizer_class": "CLIPTokenizer",
+  "unk_token": {
+    "__type": "AddedToken",
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer/vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff

unet/openvino_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4e9bf83e214896304b201ffd41794b1eb151fe36d47338660b3b0d2c3b463d10
+size 1719042636

unet/openvino_model.xml ADDED Viewed

The diff for this file is too large to render. See raw diff

vae_decoder/openvino_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bc523c30697d2cbf197960eeb4e449e1cfaa94abdbb4f3b9f5930c4a47ea9c3a
+size 98980700

vae_decoder/openvino_model.xml ADDED Viewed

The diff for this file is too large to render. See raw diff