Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,99 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
pipeline_tag: text-to-image
|
3 |
+
---
|
4 |
+
|
5 |
+
![Kalpana Grid](./kalpana_grid.jpg)
|
6 |
+
|
7 |
+
## Model Card for `enhanceaiteam/Kalpana`
|
8 |
+
|
9 |
+
`Kalpana` is a powerful text-to-image generation model developed by EnhanceAI. It uses a 20 billion parameter architecture to produce high-quality, detailed images based on textual descriptions. This model is ideal for artists, designers, and creative professionals seeking to generate visually compelling and accurate images.
|
10 |
+
|
11 |
+
### Model Details
|
12 |
+
|
13 |
+
- **Version**: 1.0
|
14 |
+
- **Base Model**: EnhanceAI's proprietary image generation architecture
|
15 |
+
- **Training Data**: Trained on a diverse dataset of high-resolution, copyright-free images, covering a broad range of visual styles and subjects.
|
16 |
+
- **Parameter Count**: 20 billion
|
17 |
+
- **Pipeline**: Stable Diffusion XL Pipeline
|
18 |
+
|
19 |
+
### Key Features
|
20 |
+
|
21 |
+
1. **High-Resolution Output**: Generates images at 1024x1024 resolution.
|
22 |
+
2. **Enhanced Prompt Understanding**: Improved comprehension of complex and detailed text prompts.
|
23 |
+
3. **Versatile Style Generation**: Capable of producing images in various artistic and realistic styles.
|
24 |
+
4. **High-Quality Images**: Delivers visually stunning and detailed images.
|
25 |
+
5. **Open Weights**: Available for scientific research and artistic projects.
|
26 |
+
|
27 |
+
### Usage
|
28 |
+
|
29 |
+
To use `Kalpana`, you can integrate it with the `diffusers` library. Below is an example of how to generate images:
|
30 |
+
|
31 |
+
```python
|
32 |
+
import torch
|
33 |
+
from diffusers import FluxPipeline
|
34 |
+
|
35 |
+
pipe = FluxPipeline.from_pretrained("enhanceaiteam/kalpana", torch_dtype=torch.bfloat16)
|
36 |
+
pipe.enable_model_cpu_offload()
|
37 |
+
|
38 |
+
prompt = "A cat holding a sign that says hello world"
|
39 |
+
out = pipe(
|
40 |
+
prompt=prompt,
|
41 |
+
guidance_scale=0.,
|
42 |
+
height=768,
|
43 |
+
width=1360,
|
44 |
+
num_inference_steps=4,
|
45 |
+
max_sequence_length=256,
|
46 |
+
).images[0]
|
47 |
+
out.save("image.png")
|
48 |
+
```
|
49 |
+
|
50 |
+
|
51 |
+
### Prompts
|
52 |
+
|
53 |
+
When creating prompts, be as descriptive as possible to achieve the desired results. Detailed descriptions help the model generate more accurate and relevant images.
|
54 |
+
|
55 |
+
### Image Samples
|
56 |
+
|
57 |
+
Here are some example images generated by `Kalpana`:
|
58 |
+
|
59 |
+
- ![Example Image 1](https://enhanceai.s3.amazonaws.com/f735ed72-5c34-425e-8628-58527053ca81_3.png)
|
60 |
+
- ![Example Image 2](https://enhanceai.s3.amazonaws.com/066df381-0429-4445-a4a6-7e098718058e_1.png)
|
61 |
+
- ![Example Image 3](https://enhanceai.s3.amazonaws.com/bffeebfa-7bff-4377-8cd2-1705a6ef92a9_1.png)
|
62 |
+
- ![Example Image 4](https://enhanceai.s3.amazonaws.com/0a81bcd2-a640-45fa-8fba-4db282e3ac4a_1.png)
|
63 |
+
- ![Example Image 5](https://enhanceai.s3.amazonaws.com/915a1e2a-32ed-45a8-bccf-d4ad6abe81d9_1.png)
|
64 |
+
- ![Example Image 6](https://enhanceai.s3.amazonaws.com/e0723a97-a425-492a-8387-8e735084d718_1.png)
|
65 |
+
|
66 |
+
### Limitations
|
67 |
+
|
68 |
+
- This model is not intended or able to provide factual information.
|
69 |
+
- As a statistical model, this checkpoint might amplify existing societal biases.
|
70 |
+
- The model may fail to generate output that matches the prompts.
|
71 |
+
- Prompt following is heavily influenced by the prompting style.
|
72 |
+
|
73 |
+
### Out-of-Scope Use
|
74 |
+
|
75 |
+
The model and its derivatives may not be used:
|
76 |
+
|
77 |
+
- In any way that violates any applicable national, federal, state, local, or international law or regulation.
|
78 |
+
- For the purpose of exploiting, harming, or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
|
79 |
+
- To generate or disseminate verifiably false information and/or content with the purpose of harming others.
|
80 |
+
- To generate or disseminate personally identifiable information that can be used to harm an individual.
|
81 |
+
- To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
|
82 |
+
- To create non-consensual nudity or illegal pornographic content.
|
83 |
+
- For fully automated decision-making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
|
84 |
+
- Generating or facilitating large-scale disinformation campaigns.
|
85 |
+
|
86 |
+
### License
|
87 |
+
|
88 |
+
This model falls under the [FLUX.1 [dev] Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).
|
89 |
+
|
90 |
+
### Contact
|
91 |
+
|
92 |
+
For support and inquiries, please reach out to [enhanceai.art](https://enhanceai.art).
|
93 |
+
|
94 |
+
---
|
95 |
+
|
96 |
+
Happy creating!
|
97 |
+
|
98 |
+
- The EnhanceAI Team
|
99 |
+
- Pranav Ajay
|