pranavajay commited on
Commit
8e1408a
1 Parent(s): 50a1482

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +99 -0
README.md ADDED
@@ -0,0 +1,99 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-to-image
3
+ ---
4
+
5
+ ![Kalpana Grid](./kalpana_grid.jpg)
6
+
7
+ ## Model Card for `enhanceaiteam/Kalpana`
8
+
9
+ `Kalpana` is a powerful text-to-image generation model developed by EnhanceAI. It uses a 20 billion parameter architecture to produce high-quality, detailed images based on textual descriptions. This model is ideal for artists, designers, and creative professionals seeking to generate visually compelling and accurate images.
10
+
11
+ ### Model Details
12
+
13
+ - **Version**: 1.0
14
+ - **Base Model**: EnhanceAI's proprietary image generation architecture
15
+ - **Training Data**: Trained on a diverse dataset of high-resolution, copyright-free images, covering a broad range of visual styles and subjects.
16
+ - **Parameter Count**: 20 billion
17
+ - **Pipeline**: Stable Diffusion XL Pipeline
18
+
19
+ ### Key Features
20
+
21
+ 1. **High-Resolution Output**: Generates images at 1024x1024 resolution.
22
+ 2. **Enhanced Prompt Understanding**: Improved comprehension of complex and detailed text prompts.
23
+ 3. **Versatile Style Generation**: Capable of producing images in various artistic and realistic styles.
24
+ 4. **High-Quality Images**: Delivers visually stunning and detailed images.
25
+ 5. **Open Weights**: Available for scientific research and artistic projects.
26
+
27
+ ### Usage
28
+
29
+ To use `Kalpana`, you can integrate it with the `diffusers` library. Below is an example of how to generate images:
30
+
31
+ ```python
32
+ import torch
33
+ from diffusers import FluxPipeline
34
+
35
+ pipe = FluxPipeline.from_pretrained("enhanceaiteam/kalpana", torch_dtype=torch.bfloat16)
36
+ pipe.enable_model_cpu_offload()
37
+
38
+ prompt = "A cat holding a sign that says hello world"
39
+ out = pipe(
40
+ prompt=prompt,
41
+ guidance_scale=0.,
42
+ height=768,
43
+ width=1360,
44
+ num_inference_steps=4,
45
+ max_sequence_length=256,
46
+ ).images[0]
47
+ out.save("image.png")
48
+ ```
49
+
50
+
51
+ ### Prompts
52
+
53
+ When creating prompts, be as descriptive as possible to achieve the desired results. Detailed descriptions help the model generate more accurate and relevant images.
54
+
55
+ ### Image Samples
56
+
57
+ Here are some example images generated by `Kalpana`:
58
+
59
+ - ![Example Image 1](https://enhanceai.s3.amazonaws.com/f735ed72-5c34-425e-8628-58527053ca81_3.png)
60
+ - ![Example Image 2](https://enhanceai.s3.amazonaws.com/066df381-0429-4445-a4a6-7e098718058e_1.png)
61
+ - ![Example Image 3](https://enhanceai.s3.amazonaws.com/bffeebfa-7bff-4377-8cd2-1705a6ef92a9_1.png)
62
+ - ![Example Image 4](https://enhanceai.s3.amazonaws.com/0a81bcd2-a640-45fa-8fba-4db282e3ac4a_1.png)
63
+ - ![Example Image 5](https://enhanceai.s3.amazonaws.com/915a1e2a-32ed-45a8-bccf-d4ad6abe81d9_1.png)
64
+ - ![Example Image 6](https://enhanceai.s3.amazonaws.com/e0723a97-a425-492a-8387-8e735084d718_1.png)
65
+
66
+ ### Limitations
67
+
68
+ - This model is not intended or able to provide factual information.
69
+ - As a statistical model, this checkpoint might amplify existing societal biases.
70
+ - The model may fail to generate output that matches the prompts.
71
+ - Prompt following is heavily influenced by the prompting style.
72
+
73
+ ### Out-of-Scope Use
74
+
75
+ The model and its derivatives may not be used:
76
+
77
+ - In any way that violates any applicable national, federal, state, local, or international law or regulation.
78
+ - For the purpose of exploiting, harming, or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
79
+ - To generate or disseminate verifiably false information and/or content with the purpose of harming others.
80
+ - To generate or disseminate personally identifiable information that can be used to harm an individual.
81
+ - To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
82
+ - To create non-consensual nudity or illegal pornographic content.
83
+ - For fully automated decision-making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
84
+ - Generating or facilitating large-scale disinformation campaigns.
85
+
86
+ ### License
87
+
88
+ This model falls under the [FLUX.1 [dev] Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).
89
+
90
+ ### Contact
91
+
92
+ For support and inquiries, please reach out to [enhanceai.art](https://enhanceai.art).
93
+
94
+ ---
95
+
96
+ Happy creating!
97
+
98
+ - The EnhanceAI Team
99
+ - Pranav Ajay