File size: 4,660 Bytes
b659c17
452a20a
f45be97
 
b659c17
452a20a
 
 
 
 
 
 
b659c17
 
f45be97
 
 
 
 
b659c17
452a20a
b659c17
505bc2f
f45be97
 
 
 
 
 
b659c17
452a20a
b659c17
505bc2f
f45be97
 
 
 
 
a17b715
452a20a
a17b715
505bc2f
f45be97
 
 
 
a17b715
452a20a
a17b715
452a20a
f45be97
 
 
 
a17b715
452a20a
a17b715
505bc2f
f45be97
 
 
 
452a20a
 
 
 
b659c17
 
65ad8a2
b659c17
 
 
 
 
 
 
adeedcb
 
 
 
 
b659c17
 
 
1e730d0
 
b659c17
 
00ebd8e
b659c17
 
1e730d0
 
 
 
 
 
b659c17
 
 
 
 
68d54e2
 
1e730d0
b659c17
1e730d0
452a20a
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
---
base_model: PixArt-alpha/PixArt-Sigma-XL-2-1024-MS
library_name: diffusers
license: creativeml-openrail-m
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- full
- pixart
- pixart sigma
inference: true
widget:
- text: A blonde sexy girl, wearing glasses at latex shirt and a blue beanie with
    a tattoo, blue and white, highly detailed, sublime, extremely beautiful, sharp
    focus, refined, cinematic, intricate, elegant, dynamic, rich deep colors, bright
    color, shining light, attractive, cute, pretty, background full, epic composition,
    dramatic atmosphere, radiant, professional, stunning
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/1.png
- text: a wizard with a glowing staff and a glowing hat, colorful magic, dramatic
    atmosphere, sharp focus, highly detailed, cinematic, original composition, fine
    detail, intricate, elegant, creative, color spread, shiny, amazing, symmetry,
    illuminated, inspired, pretty, attractive, artistic, dynamic background, relaxed,
    professional, extremely inspirational, beautiful, determined, cute, adorable,
    best
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/2.png
- text: girl in modern car, intricate, elegant, highly detailed, extremely complimentary
    colors, beautiful, glowing aesthetic, pretty, dramatic light, sharp focus, perfect
    composition, clear artistic color, calm professional background, precise, joyful,
    emotional, unique, cute, best, gorgeous, great delicate, expressive, thought,
    iconic, fine, awesome, creative, winning, charming, enhanced
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/3.png
- text: A girl stands amidst scattered glass shards, surrounded by a beautifully crafted
    and expansive world. The scene is depicted from a dynamic angle, emphasizing her
    determined expression. The background features vast landscapes with floating crystals
    and soft, glowing lights that create a mystical and grand atmosphere.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/ComfyUI_PixArt_00040_.png
- text: A girl stands amidst scattered glass shards, surrounded by a beautifully crafted
    and expansive world. The scene is depicted from a dynamic angle, emphasizing her
    determined expression. The background features vast landscapes with floating crystals
    and soft, glowing lights that create a mystical and grand atmosphere.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/ComfyUI_PixArt_00036_.png
- text: A close-up shot of a beautiful girl in a serene world. She has white hair
    and is blindfolded, with a calm expression. Her hands are pressed together in
    a prayer pose, with fingers interlaced and palms touching. The background is softly
    blurred, enhancing her ethereal presence.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/ComfyUI_PixArt_00041_.png
---

# SigmaJourney: PixartSigma + MidJourney v6


<Gallery />


## Inference

### ComfyUI
- Download model file `transformer/diffusion_pytorch_model.safetensors` and put into `ComfyUI/models/checkpoints`
- Use ExtraModels node: https://github.com/city96/ComfyUI_ExtraModels?tab=readme-ov-file#pixart

![image/png](https://cdn-uploads.huggingface.co/production/uploads/643c7e91b409fef15e0bd11b/MJfTShin1fYOOCo4mTv2-.png)

```python
import torch
from diffusers import DiffusionPipeline, EulerAncestralDiscreteScheduler
from diffusers.models import PixArtTransformer2DModel


model_id = "TensorFamily/SigmaJourney"
negative_prompt = "malformed, disgusting, overexposed, washed-out"

pipeline = DiffusionPipeline.from_pretrained("PixArt-alpha/PixArt-Sigma-XL-2-1024-MS", torch_dtype=torch.float16)
pipeline.transformer = PixArtTransformer2DModel.from_pretrained(model_id, subfolder="transformer", torch_dtype=torch.float16)
pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(pipeline.scheduler.config)
pipeline.to('cuda' if torch.cuda.is_available() else 'cpu')

prompt = "On the left, there is a red cube. On the right, there is a blue sphere. On top of the red cube is a dog. On top of the blue sphere is a cat"
image = pipeline(
    prompt=prompt,
    negative_prompt='blurry, cropped, ugly',
    num_inference_steps=30,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
    width=1024,
    height=1024,
    guidance_scale=5.5,
).images[0]
image.save("output.png", format="JPEG")
```