bghira commited on
Commit
496cdcf
·
verified ·
1 Parent(s): 9232ccf

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +305 -0
README.md ADDED
@@ -0,0 +1,305 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "black-forest-labs/FLUX.1-dev"
4
+ tags:
5
+ - flux
6
+ - flux-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - not-for-all-audiences
11
+ - lora
12
+ - template:sd-lora
13
+ - lycoris
14
+ inference: true
15
+ widget:
16
+ - text: 'unconditional (blank prompt)'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/image_0_0.png
21
+ - text: 'unconditional (blank prompt)'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/image_1_1.png
26
+ - text: 'a photo of cheech marin'
27
+ parameters:
28
+ negative_prompt: 'blurry, cropped, ugly'
29
+ output:
30
+ url: ./assets/image_2_0.png
31
+ - text: 'a photo of cheech marin'
32
+ parameters:
33
+ negative_prompt: 'blurry, cropped, ugly'
34
+ output:
35
+ url: ./assets/image_3_1.png
36
+ - text: 'a photo of tommy chong'
37
+ parameters:
38
+ negative_prompt: 'blurry, cropped, ugly'
39
+ output:
40
+ url: ./assets/image_4_0.png
41
+ - text: 'a photo of tommy chong'
42
+ parameters:
43
+ negative_prompt: 'blurry, cropped, ugly'
44
+ output:
45
+ url: ./assets/image_5_1.png
46
+ - text: 'a photo with tommy chong sitting to the left of cheech marin'
47
+ parameters:
48
+ negative_prompt: 'blurry, cropped, ugly'
49
+ output:
50
+ url: ./assets/image_6_0.png
51
+ - text: 'a photo with tommy chong sitting to the left of cheech marin'
52
+ parameters:
53
+ negative_prompt: 'blurry, cropped, ugly'
54
+ output:
55
+ url: ./assets/image_7_1.png
56
+ - text: 'a photo with cheech marin sitting to the right of tommy chong'
57
+ parameters:
58
+ negative_prompt: 'blurry, cropped, ugly'
59
+ output:
60
+ url: ./assets/image_8_0.png
61
+ - text: 'a photo with cheech marin sitting to the right of tommy chong'
62
+ parameters:
63
+ negative_prompt: 'blurry, cropped, ugly'
64
+ output:
65
+ url: ./assets/image_9_1.png
66
+ - text: 'cheech and chong together in a photograph'
67
+ parameters:
68
+ negative_prompt: 'blurry, cropped, ugly'
69
+ output:
70
+ url: ./assets/image_10_0.png
71
+ - text: 'cheech and chong together in a photograph'
72
+ parameters:
73
+ negative_prompt: 'blurry, cropped, ugly'
74
+ output:
75
+ url: ./assets/image_11_1.png
76
+ - text: 'young cheech and chong in a black and white photograph'
77
+ parameters:
78
+ negative_prompt: 'blurry, cropped, ugly'
79
+ output:
80
+ url: ./assets/image_12_0.png
81
+ - text: 'young cheech and chong in a black and white photograph'
82
+ parameters:
83
+ negative_prompt: 'blurry, cropped, ugly'
84
+ output:
85
+ url: ./assets/image_13_1.png
86
+ - text: 'elderly cheech and chong in an interview on the BBC'
87
+ parameters:
88
+ negative_prompt: 'blurry, cropped, ugly'
89
+ output:
90
+ url: ./assets/image_14_0.png
91
+ - text: 'elderly cheech and chong in an interview on the BBC'
92
+ parameters:
93
+ negative_prompt: 'blurry, cropped, ugly'
94
+ output:
95
+ url: ./assets/image_15_1.png
96
+ - text: 'old tommy chong on a sitcom in the 1990s'
97
+ parameters:
98
+ negative_prompt: 'blurry, cropped, ugly'
99
+ output:
100
+ url: ./assets/image_16_0.png
101
+ - text: 'old tommy chong on a sitcom in the 1990s'
102
+ parameters:
103
+ negative_prompt: 'blurry, cropped, ugly'
104
+ output:
105
+ url: ./assets/image_17_1.png
106
+ - text: 'anime cheech marin'
107
+ parameters:
108
+ negative_prompt: 'blurry, cropped, ugly'
109
+ output:
110
+ url: ./assets/image_18_0.png
111
+ - text: 'anime cheech marin'
112
+ parameters:
113
+ negative_prompt: 'blurry, cropped, ugly'
114
+ output:
115
+ url: ./assets/image_19_1.png
116
+ - text: 'anime tommy chong'
117
+ parameters:
118
+ negative_prompt: 'blurry, cropped, ugly'
119
+ output:
120
+ url: ./assets/image_20_0.png
121
+ - text: 'anime tommy chong'
122
+ parameters:
123
+ negative_prompt: 'blurry, cropped, ugly'
124
+ output:
125
+ url: ./assets/image_21_1.png
126
+ - text: 'A photo-realistic image of a tommy chong'
127
+ parameters:
128
+ negative_prompt: 'blurry, cropped, ugly'
129
+ output:
130
+ url: ./assets/image_22_0.png
131
+ - text: 'A photo-realistic image of a tommy chong'
132
+ parameters:
133
+ negative_prompt: 'blurry, cropped, ugly'
134
+ output:
135
+ url: ./assets/image_23_1.png
136
+ ---
137
+
138
+ # Flux.1-dev-LoKr-test1.4-nomask
139
+
140
+ This is a LyCORIS adapter derived from [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev).
141
+
142
+
143
+ The main validation prompt used during training was:
144
+
145
+
146
+
147
+ ```
148
+ A photo-realistic image of a tommy chong
149
+ ```
150
+
151
+ ## Validation settings
152
+ - CFG: `3.0`
153
+ - CFG Rescale: `0.0`
154
+ - Steps: `20`
155
+ - Sampler: `None`
156
+ - Seed: `42`
157
+ - Resolutions: `1024x1024,1280x768`
158
+
159
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
160
+
161
+ You can find some example images in the following gallery:
162
+
163
+
164
+ <Gallery />
165
+
166
+ The text encoder **was not** trained.
167
+ You may reuse the base model text encoder for inference.
168
+
169
+
170
+ ## Training settings
171
+
172
+ - Training epochs: 0
173
+ - Training steps: 100
174
+ - Learning rate: 0.001
175
+ - Effective batch size: 6
176
+ - Micro-batch size: 2
177
+ - Gradient accumulation steps: 1
178
+ - Number of GPUs: 3
179
+ - Prediction type: flow-matching
180
+ - Rescaled betas zero SNR: False
181
+ - Optimizer: optimi-stableadamwweight_decay=1e-3
182
+ - Precision: Pure BF16
183
+ - Quantised: Yes: int8-quanto
184
+ - Xformers: Not used
185
+ - LyCORIS Config:
186
+ ```json
187
+ {
188
+ "algo": "lokr",
189
+ "multiplier": 1.0,
190
+ "linear_dim": 10000,
191
+ "linear_alpha": 1,
192
+ "factor": 12,
193
+ "apply_preset": {
194
+ "target_module": [
195
+ "Attention",
196
+ "FeedForward"
197
+ ],
198
+ "module_algo_map": {
199
+ "Attention": {
200
+ "factor": 12
201
+ },
202
+ "FeedForward": {
203
+ "factor": 6
204
+ }
205
+ }
206
+ }
207
+ }
208
+ ```
209
+
210
+ ## Datasets
211
+
212
+ ### cheechandchong-512
213
+ - Repeats: 100
214
+ - Total number of images: ~24
215
+ - Total number of aspect buckets: 5
216
+ - Resolution: 0.262144 megapixels
217
+ - Cropped: False
218
+ - Crop style: None
219
+ - Crop aspect: None
220
+ ### cheechandchong-1024
221
+ - Repeats: 100
222
+ - Total number of images: ~30
223
+ - Total number of aspect buckets: 8
224
+ - Resolution: 1.048576 megapixels
225
+ - Cropped: False
226
+ - Crop style: None
227
+ - Crop aspect: None
228
+ ### cheechandchong-512-crop
229
+ - Repeats: 100
230
+ - Total number of images: ~18
231
+ - Total number of aspect buckets: 1
232
+ - Resolution: 0.262144 megapixels
233
+ - Cropped: True
234
+ - Crop style: random
235
+ - Crop aspect: square
236
+ ### cheechandchong-1024-crop
237
+ - Repeats: 100
238
+ - Total number of images: ~18
239
+ - Total number of aspect buckets: 1
240
+ - Resolution: 1.048576 megapixels
241
+ - Cropped: True
242
+ - Crop style: random
243
+ - Crop aspect: square
244
+ ### regularisation-512
245
+ - Repeats: 0
246
+ - Total number of images: ~5886
247
+ - Total number of aspect buckets: 10
248
+ - Resolution: 0.262144 megapixels
249
+ - Cropped: False
250
+ - Crop style: None
251
+ - Crop aspect: None
252
+ ### regularisation-1024
253
+ - Repeats: 0
254
+ - Total number of images: ~5892
255
+ - Total number of aspect buckets: 20
256
+ - Resolution: 1.048576 megapixels
257
+ - Cropped: False
258
+ - Crop style: None
259
+ - Crop aspect: None
260
+ ### regularisation-512-crop
261
+ - Repeats: 0
262
+ - Total number of images: ~5874
263
+ - Total number of aspect buckets: 1
264
+ - Resolution: 0.262144 megapixels
265
+ - Cropped: True
266
+ - Crop style: random
267
+ - Crop aspect: square
268
+ ### regularisation-1024-crop
269
+ - Repeats: 0
270
+ - Total number of images: ~5874
271
+ - Total number of aspect buckets: 1
272
+ - Resolution: 1.048576 megapixels
273
+ - Cropped: True
274
+ - Crop style: random
275
+ - Crop aspect: square
276
+
277
+
278
+ ## Inference
279
+
280
+
281
+ ```python
282
+ import torch
283
+ from diffusers import DiffusionPipeline
284
+ from lycoris import create_lycoris_from_weights
285
+
286
+ model_id = 'black-forest-labs/FLUX.1-dev'
287
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
288
+ lora_scale = 1.0
289
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
290
+ wrapper.merge_to()
291
+
292
+ prompt = "A photo-realistic image of a tommy chong"
293
+
294
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
295
+ image = pipeline(
296
+ prompt=prompt,
297
+ num_inference_steps=20,
298
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
299
+ width=1024,
300
+ height=1024,
301
+ guidance_scale=3.0,
302
+ ).images[0]
303
+ image.save("output.png", format="PNG")
304
+ ```
305
+