leditsplusplus

Running on A10G

App Files Files Community

binhle0209 commited on Mar 6

Commit

d101b98

•

1 Parent(s): 4ff5549

Edit attention map selection upon num_edit_tokens

Browse files

In line 932-935, selecting attention map range should be upon the `num_edit_tokens[batch_size*c]`, as the `num_edit_tokens` having each concept repeated `batch_size` times.

Files changed (1) hide show

pipeline_semantic_stable_diffusion_img2img_solver.py +3 -3

pipeline_semantic_stable_diffusion_img2img_solver.py CHANGED Viewed

@@ -928,11 +928,11 @@ class SemanticStableDiffusionImg2ImgPipeline_DPMSolver(DiffusionPipeline):
                                 from_where=["up", "down"],
                                 is_cross=True,
                                 select=text_cross_attention_maps.index(editing_prompt[c]),
-                            )
-                            attn_map = out[:, :, :, 1:1 + num_edit_tokens[c]]  # 0 -> startoftext
                             # average over all tokens
-                            assert (attn_map.shape[3] == num_edit_tokens[c])
                             attn_map = torch.sum(attn_map, dim=3)
                             # gaussian_smoothing

                                 from_where=["up", "down"],
                                 is_cross=True,
                                 select=text_cross_attention_maps.index(editing_prompt[c]),
+                            )
+                            attn_map = out[:, :, :, 1:1 + num_edit_tokens[self.batch_size*c]]  # 0 -> startoftext
                             # average over all tokens
+                            assert (attn_map.shape[3] == num_edit_tokens[self.batch_size*c])
                             attn_map = torch.sum(attn_map, dim=3)
                             # gaussian_smoothing