--- license: apache-2.0 datasets: - lambdalabs/pokemon-blip-captions language: - en --- This is the highly optimized version of the [Stable Diffusion model for pokemon generation](https://huggingface.co/svjack/Stable-Diffusion-Pokemon-en). The model was optimized with a combination of two methods: * Quantization-aware training from [NNCF](https://github.com/openvinotoolkit/nncf). * A modification of the Token Merging method from [here](https://github.com/AlexKoff88/tomesd/tree/openvino). To run the model use the following code: ```python %pip install optimum[openvino,diffusers] from optimum.intel.openvino import OVStableDiffusionPipeline from diffusers import LMSDiscreteScheduler, DDPMScheduler import torch import random import numpy as np pipe = OVStableDiffusionPipeline.from_pretrained("OpenVINO/stable-diffusion-pokemons-tome-quantized-aggressive", compile=False) pipe.reshape(batch_size=1, height=512, width=512, num_images_per_prompt=1) pipe.compile() # Use original model to compare # pipe = OVStableDiffusionPipeline.from_pretrained("svjack/Stable-Diffusion-Pokemon-en", export=True, compile=False) prompt = "cartoon bird" output = pipe(prompt, num_inference_steps=50, output_type="pil") output.images[0].save("output.png") ```