|
--- |
|
pipeline_tag: text-to-image |
|
tags: |
|
- art |
|
- image-generation |
|
- text-to-image |
|
- diffusion |
|
- high-quality |
|
- AI |
|
thumbnail: https://graph.org/file/66aad0f496f8a1390f222.jpg |
|
--- |
|
|
|
![Kalpana Grid](./1f7207c48a5447d8af7bbd25be3ba46b_af3baa67856547f6aba7c4f9161d704c.jpg) |
|
|
|
## Model Card for `enhanceaiteam/Kalpana` |
|
|
|
`Kalpana` is a powerful text-to-image generation model developed by EnhanceAI. It uses a 20 billion parameter architecture to produce high-quality, detailed images based on textual descriptions. This model is ideal for artists, designers, and creative professionals seeking to generate visually compelling and accurate images. |
|
|
|
### Model Details |
|
|
|
- **Version**: 1.0 |
|
- **Base Model**: Flux and Sdxl |
|
- **Parameter Count**: 20 billion |
|
- **Pipeline**: Flux Pipeline, Stable diffusion 3 |
|
|
|
### Key Features |
|
|
|
1. **High-Resolution Output**: Generates images at 1024x1024 resolution. |
|
2. **Enhanced Prompt Understanding**: Improved comprehension of complex and detailed text prompts. |
|
3. **Versatile Style Generation**: Capable of producing images in various artistic and realistic styles. |
|
4. **High-Quality Images**: Delivers visually stunning and detailed images. |
|
5. **Open Weights**: Available for scientific research and artistic projects. |
|
|
|
### Usage |
|
|
|
To use `Kalpana`, you can integrate it with the `diffusers` library. Below is an example of how to generate images: |
|
|
|
```python |
|
import torch |
|
from diffusers import FluxPipeline |
|
|
|
pipe = FluxPipeline.from_pretrained("enhanceaiteam/kalpana", torch_dtype=torch.bfloat16) |
|
pipe.enable_model_cpu_offload() |
|
|
|
prompt = "A cat holding a sign that says hello world" |
|
out = pipe( |
|
prompt=prompt, |
|
guidance_scale=0., |
|
height=768, |
|
width=1360, |
|
num_inference_steps=4, |
|
max_sequence_length=256, |
|
).images[0] |
|
out.save("image.png") |
|
``` |
|
|
|
|
|
### Prompts |
|
|
|
When creating prompts, be as descriptive as possible to achieve the desired results. Detailed descriptions help the model generate more accurate and relevant images. |
|
|
|
### Image Samples |
|
|
|
Here are some example images generated by `Kalpana`: |
|
|
|
- ![Example Image 1](https://graph.org/file/7720981460870d26a8918.jpg) |
|
- ![Example Image 2](https://graph.org/file/669d85cecd5c2546aa9d8.jpg) |
|
- ![Example Image 3](https://graph.org/file/545de8682a79eeca52a2e.jpg) |
|
- ![Example Image 4](https://graph.org/file/4ce8fbb1e2659a9e8d941.jpg) |
|
- ![Example Image 5](https://graph.org/file/808e757bbcdcb1b309f46.jpg) |
|
- ![Example Image 6](https://graph.org/file/f2ad0dfbadf37ee965670.jpg) |
|
- ![Example Image 7](https://graph.org/file/33ee184c789f4e807b392.jpg) |
|
- ![Example Image 8](https://graph.org/file/2d7cca147719a2c70a6cf.jpg) |
|
- ![Example Image 9](https://graph.org/file/0f56697b6edff47f5e240.jpg) |
|
|
|
### Limitations |
|
|
|
- This model is not intended or able to provide factual information. |
|
- As a statistical model, this checkpoint might amplify existing societal biases. |
|
- The model may fail to generate output that matches the prompts. |
|
- Prompt following is heavily influenced by the prompting style. |
|
|
|
### Out-of-Scope Use |
|
|
|
The model and its derivatives may not be used: |
|
|
|
- In any way that violates any applicable national, federal, state, local, or international law or regulation. |
|
- For the purpose of exploiting, harming, or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content. |
|
- To generate or disseminate verifiably false information and/or content with the purpose of harming others. |
|
- To generate or disseminate personally identifiable information that can be used to harm an individual. |
|
- To harass, abuse, threaten, stalk, or bully individuals or groups of individuals. |
|
- To create non-consensual nudity or illegal pornographic content. |
|
- For fully automated decision-making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation. |
|
- Generating or facilitating large-scale disinformation campaigns. |
|
|
|
### License |
|
|
|
This model falls under the [FLUX.1 [dev] Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md). |
|
|
|
### Contact |
|
|
|
For support and inquiries, please reach out to [enhanceai.art](https://enhanceai.art). |
|
|
|
--- |
|
|
|
Happy creating! |
|
|
|
- The EnhanceAI Team |
|
- Pranav Ajay |