|
--- |
|
license: creativeml-openrail-m |
|
tags: |
|
- text-to-image |
|
- stable-diffusion |
|
language: |
|
- en |
|
library_name: diffusers |
|
pipeline_tag: text-to-image |
|
--- |
|
|
|
# Model Card: Stable-Cats-Generator |
|
|
|
## Model Information |
|
|
|
- **Model Name:** Stable-Cats-Generator |
|
- **Model Version:** v1 |
|
- **Model Type:** Image Generation |
|
- **Based on:** Stable Diffusion v2 |
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6338c06c107c4835a05699f9/0b8P7pCT91aaflI_8s5UK.png) |
|
|
|
## Model Description |
|
|
|
Stable-Cats-Generator is an image generation model fine-tuned for generating white cat images based on text prompts. |
|
It is built upon **Stable Diffusion v2** and utilizes a pretrained text encoder (OpenCLIP-ViT/H) for text-to-image generation. |
|
**Stable Diffusion v2** is the latest version of the Stable Diffusion text-to-image diffusion model. |
|
It was released in 2023 and is based on the same core principles as the original Stable Diffusion model, but it has a number of improvements |
|
|
|
|
|
## Intended Use |
|
|
|
- Safe content generation |
|
- Artistic and creative processes |
|
- Bias and limitation exploration |
|
- Educational and creative tools |
|
|
|
## Potential Use Cases |
|
|
|
- Generating cat images for artistic purposes |
|
- Investigating biases and limitations of generative models |
|
- Creating safe and customizable content |
|
- Enhancing educational or creative tools |
|
|
|
## Model Capabilities |
|
|
|
- High-quality white cat image generation |
|
- Quick image generation, even on single GPUs |
|
- Customizable for specific needs and datasets |
|
|
|
## Limitations |
|
|
|
- May not always produce realistic images |
|
- Limited to generating white cat images based on text prompts |
|
- Ethical considerations when using generated content |
|
|
|
## Ethical Considerations |
|
|
|
- Ensure generated content is safe and non-harmful |
|
- Monitor and mitigate potential biases in generated content |
|
- Respect copyright and licensing when using generated images |
|
|
|
|
|
## Responsible AI |
|
|
|
- Ongoing monitoring and evaluation of model outputs |
|
- Regular updates to address limitations and improve safety |
|
- Compliance with ethical guidelines and legal regulations |
|
|
|
|
|
## Disclaimer |
|
|
|
This model card serves as a documentation tool and does not constitute legal or ethical guidance. Users of the model are responsible for adhering to ethical and legal standards in their use of the model. |
|
|
|
## Usage |
|
|
|
``` |
|
pip install diffusers==0.11.1 |
|
pip install transformers scipy ftfy accelerate |
|
``` |
|
```python |
|
import torch |
|
from diffusers import StableDiffusionPipeline |
|
|
|
pipe = StableDiffusionPipeline.from_pretrained("ayoubkirouane/Stable-Cats-Generator", torch_dtype=torch.float16) |
|
pipe = pipe.to("cuda") |
|
|
|
prompt = "A photo of a picture-perfect white cat." |
|
image = pipe(prompt).images[0] # image here is in [PIL format](https://pillow.readthedocs.io/en/stable/) |
|
|
|
# Now to display an image you can either save it such as: |
|
image.save(f"cat.png") |
|
|
|
# or if you're in a google colab you can directly display it with |
|
image |
|
|
|
``` |
|
|