File size: 4,290 Bytes
8e1408a
 
9d7ab2d
 
 
 
 
 
 
 
39c4f46
 
8e1408a
 
749ad01
8e1408a
 
 
 
 
 
 
 
a87ad17
8e1408a
1cf9ddd
8e1408a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c795e3c
 
 
 
 
 
 
 
 
8e1408a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
---
pipeline_tag: text-to-image
tags:
- art
- image-generation
- text-to-image
- diffusion
- high-quality
- AI
thumbnail: https://graph.org/file/66aad0f496f8a1390f222.jpg
language:
- en
---

![Kalpana Grid](./1f7207c48a5447d8af7bbd25be3ba46b_af3baa67856547f6aba7c4f9161d704c.jpg)

## Model Card for `enhanceaiteam/Kalpana`

`Kalpana` is a powerful text-to-image generation model developed by EnhanceAI. It uses a 20 billion parameter architecture to produce high-quality, detailed images based on textual descriptions. This model is ideal for artists, designers, and creative professionals seeking to generate visually compelling and accurate images.

### Model Details

- **Version**: 1.0
- **Base Model**: Flux and Sdxl
- **Parameter Count**: 20 billion
- **Pipeline**: Flux Pipeline, Stable diffusion 3

### Key Features

1. **High-Resolution Output**: Generates images at 1024x1024 resolution.
2. **Enhanced Prompt Understanding**: Improved comprehension of complex and detailed text prompts.
3. **Versatile Style Generation**: Capable of producing images in various artistic and realistic styles.
4. **High-Quality Images**: Delivers visually stunning and detailed images.
5. **Open Weights**: Available for scientific research and artistic projects.

### Usage

To use `Kalpana`, you can integrate it with the `diffusers` library. Below is an example of how to generate images:

```python
import torch
from diffusers import FluxPipeline

pipe = FluxPipeline.from_pretrained("enhanceaiteam/kalpana", torch_dtype=torch.bfloat16)
pipe.enable_model_cpu_offload()

prompt = "A cat holding a sign that says hello world"
out = pipe(
    prompt=prompt,
    guidance_scale=0.,
    height=768,
    width=1360,
    num_inference_steps=4,
    max_sequence_length=256,
).images[0]
out.save("image.png")
```


### Prompts

When creating prompts, be as descriptive as possible to achieve the desired results. Detailed descriptions help the model generate more accurate and relevant images.

### Image Samples

Here are some example images generated by `Kalpana`:

- ![Example Image 1](https://graph.org/file/7720981460870d26a8918.jpg)
- ![Example Image 2](https://graph.org/file/669d85cecd5c2546aa9d8.jpg)
- ![Example Image 3](https://graph.org/file/545de8682a79eeca52a2e.jpg)
- ![Example Image 4](https://graph.org/file/4ce8fbb1e2659a9e8d941.jpg)
- ![Example Image 5](https://graph.org/file/808e757bbcdcb1b309f46.jpg)
- ![Example Image 6](https://graph.org/file/f2ad0dfbadf37ee965670.jpg)
- ![Example Image 7](https://graph.org/file/33ee184c789f4e807b392.jpg)
- ![Example Image 8](https://graph.org/file/2d7cca147719a2c70a6cf.jpg)
- ![Example Image 9](https://graph.org/file/0f56697b6edff47f5e240.jpg)

### Limitations

- This model is not intended or able to provide factual information.
- As a statistical model, this checkpoint might amplify existing societal biases.
- The model may fail to generate output that matches the prompts.
- Prompt following is heavily influenced by the prompting style.

### Out-of-Scope Use

The model and its derivatives may not be used:

- In any way that violates any applicable national, federal, state, local, or international law or regulation.
- For the purpose of exploiting, harming, or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
- To generate or disseminate verifiably false information and/or content with the purpose of harming others.
- To generate or disseminate personally identifiable information that can be used to harm an individual.
- To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
- To create non-consensual nudity or illegal pornographic content.
- For fully automated decision-making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
- Generating or facilitating large-scale disinformation campaigns.

### License

This model falls under the [FLUX.1 [dev] Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).

### Contact

For support and inquiries, please reach out to [enhanceai.art](https://enhanceai.art).

---

Happy creating!

- The EnhanceAI Team
- Pranav Ajay