dataautogpt3 commited on
Commit
3f85ee3
1 Parent(s): 15d77c8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -21
README.md CHANGED
@@ -1,21 +1,36 @@
1
- ---
2
- license: cc-by-nc-4.0
3
- pipeline:
4
- ---
5
- sd 1.5 fine-tuned on 131000 high-quality captioned image pairs generated from dalle3 on 4 3090s with nvlink for 16hrs for 8 epochs.
6
-
7
- it seems to be good at people, hands, and text but not animals.
8
-
9
- unique examples: 13100
10
-
11
- num examples: 131000
12
-
13
- num epochs: 8
14
-
15
- num examples: 31000
16
-
17
- total train batch size: 40
18
-
19
- gradient accumulation = 1
20
-
21
- total optimization steps: 26200
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ license: cc-by-nc-nd-4.0
2
+ pipeline_tag: text-to-image
3
+ description: >
4
+ This model is a fine-tuned version of Stable Diffusion 1.5, specifically enhanced for generating high-quality images of people, hands, and text. It has been trained on 131,000 high-quality, captioned image pairs generated using DALL-E 3. The training was conducted on four NVIDIA 3090 GPUs with NVLink over 16 hours, spanning 8 epochs.
5
+
6
+ The model demonstrates notable proficiency in rendering human figures and intricate details like hand gestures and written text, although it shows less effectiveness with animal imagery. This specialization makes it well-suited for applications requiring precise human and text representations.
7
+
8
+ The fine-tuning process involved 13,100 unique examples, contributing to a total dataset size of 131,000 images. Each training epoch processed 31,000 examples, with a total train batch size of 40. The model underwent a total of 26,200 optimization steps, maintaining a gradient accumulation of 1 throughout the training period.
9
+
10
+ The enhancements in this version aim to minimize common image generation flaws such as blurriness, disproportion, noise, and low resolution, ensuring clear and anatomically accurate outputs.
11
+
12
+ widget:
13
+ - text: '-'
14
+ output:
15
+ url: ComfyUI_00641_.png
16
+ - text: '-'
17
+ output:
18
+ url: ComfyUI_00637_.png
19
+ - text: '-'
20
+ output:
21
+ url: ComfyUI_00623_.png
22
+ - text: '-'
23
+ output:
24
+ url: ComfyUI_00617_.png
25
+ - text: '-'
26
+ output:
27
+ url: ComfyUI_00615_.png
28
+ - text: '-'
29
+ parameters:
30
+ negative_prompt: >
31
+ bad quality, bad anatomy, worst quality, low quality, low resolution,
32
+ extra fingers, blur, blurry, ugly, wrong proportions, watermark, image
33
+ artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image,
34
+ embedding:ac_neg1,
35
+ output:
36
+ url: ComfyUI_00614_.png