PseudoTerminal X
commited on
Commit
•
5311988
1
Parent(s):
c6d3a34
Trained for 0 epochs and 5100 steps.
Browse filesTrained with datasets ['text-embeds-sdxl', 'photocb-clip-embeds', 'celebrities', 'movieposters', 'normalnudes', 'propagandaposters', 'guys', 'pixel-art', 'signs', 'moviecollection', 'bookcovers', 'nijijourney', 'experimental', 'ethnic', 'sports', 'gay', 'architecture', 'shutterstock', 'cinemamix-1mp', 'nsfw-1024', 'anatomy', 'bg20k-1024', 'yoga', 'photo-aesthetics', 'text-1mp', 'photo-concept-bucket']
Learning rate 4e-07, batch size 8, and 4 gradient accumulation steps.
Used DDPM noise scheduler for training with v_prediction prediction type and rescaled_betas_zero_snr=True
Using 'trailing' timestep spacing.
Base model: ptx0/terminus-xl-velocity-v1
VAE: madebyollin/sdxl-vae-fp16-fix
- README.md +11 -11
- optimizer.bin +1 -1
- random_states_0.pkl +1 -1
- scheduler.bin +1 -1
- training_state-anatomy.json +0 -0
- training_state-architecture.json +0 -0
- training_state-bg20k-1024.json +2 -2
- training_state-cinemamix-1mp.json +0 -0
- training_state-ethnic.json +0 -0
- training_state-experimental.json +0 -0
- training_state-nsfw-1024.json +0 -0
- training_state-photo-aesthetics.json +0 -0
- training_state-photo-concept-bucket.json +2 -2
- training_state-shutterstock.json +0 -0
- training_state-text-1mp.json +0 -0
- training_state-yoga.json +0 -0
- training_state.json +1 -1
- unet/config.json +1 -1
- unet/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -44,7 +44,7 @@ You may reuse the base model text encoder for inference.
|
|
44 |
## Training settings
|
45 |
|
46 |
- Training epochs: 0
|
47 |
-
- Training steps:
|
48 |
- Learning rate: 4e-07
|
49 |
- Effective batch size: 32
|
50 |
- Micro-batch size: 8
|
@@ -59,7 +59,7 @@ You may reuse the base model text encoder for inference.
|
|
59 |
## Datasets
|
60 |
|
61 |
### celebrities
|
62 |
-
- Repeats:
|
63 |
- Total number of images: 1184
|
64 |
- Total number of aspect buckets: 3
|
65 |
- Resolution: 1.0 megapixels
|
@@ -67,7 +67,7 @@ You may reuse the base model text encoder for inference.
|
|
67 |
- Crop style: random
|
68 |
- Crop aspect: random
|
69 |
### movieposters
|
70 |
-
- Repeats:
|
71 |
- Total number of images: 1728
|
72 |
- Total number of aspect buckets: 3
|
73 |
- Resolution: 1.0 megapixels
|
@@ -75,7 +75,7 @@ You may reuse the base model text encoder for inference.
|
|
75 |
- Crop style: random
|
76 |
- Crop aspect: random
|
77 |
### normalnudes
|
78 |
-
- Repeats:
|
79 |
- Total number of images: 1056
|
80 |
- Total number of aspect buckets: 3
|
81 |
- Resolution: 1.0 megapixels
|
@@ -91,7 +91,7 @@ You may reuse the base model text encoder for inference.
|
|
91 |
- Crop style: random
|
92 |
- Crop aspect: random
|
93 |
### guys
|
94 |
-
- Repeats:
|
95 |
- Total number of images: 352
|
96 |
- Total number of aspect buckets: 3
|
97 |
- Resolution: 1.0 megapixels
|
@@ -107,7 +107,7 @@ You may reuse the base model text encoder for inference.
|
|
107 |
- Crop style: random
|
108 |
- Crop aspect: random
|
109 |
### signs
|
110 |
-
- Repeats:
|
111 |
- Total number of images: 352
|
112 |
- Total number of aspect buckets: 3
|
113 |
- Resolution: 1.0 megapixels
|
@@ -140,7 +140,7 @@ You may reuse the base model text encoder for inference.
|
|
140 |
- Crop aspect: random
|
141 |
### experimental
|
142 |
- Repeats: 0
|
143 |
-
- Total number of images:
|
144 |
- Total number of aspect buckets: 3
|
145 |
- Resolution: 1.0 megapixels
|
146 |
- Cropped: True
|
@@ -148,8 +148,8 @@ You may reuse the base model text encoder for inference.
|
|
148 |
- Crop aspect: random
|
149 |
### ethnic
|
150 |
- Repeats: 0
|
151 |
-
- Total number of images:
|
152 |
-
- Total number of aspect buckets:
|
153 |
- Resolution: 1.0 megapixels
|
154 |
- Cropped: True
|
155 |
- Crop style: random
|
@@ -173,7 +173,7 @@ You may reuse the base model text encoder for inference.
|
|
173 |
### architecture
|
174 |
- Repeats: 0
|
175 |
- Total number of images: 4324
|
176 |
-
- Total number of aspect buckets:
|
177 |
- Resolution: 1.0 megapixels
|
178 |
- Cropped: True
|
179 |
- Crop style: random
|
@@ -220,7 +220,7 @@ You may reuse the base model text encoder for inference.
|
|
220 |
- Crop aspect: random
|
221 |
### yoga
|
222 |
- Repeats: 0
|
223 |
-
- Total number of images:
|
224 |
- Total number of aspect buckets: 3
|
225 |
- Resolution: 1.0 megapixels
|
226 |
- Cropped: True
|
|
|
44 |
## Training settings
|
45 |
|
46 |
- Training epochs: 0
|
47 |
+
- Training steps: 5100
|
48 |
- Learning rate: 4e-07
|
49 |
- Effective batch size: 32
|
50 |
- Micro-batch size: 8
|
|
|
59 |
## Datasets
|
60 |
|
61 |
### celebrities
|
62 |
+
- Repeats: 4
|
63 |
- Total number of images: 1184
|
64 |
- Total number of aspect buckets: 3
|
65 |
- Resolution: 1.0 megapixels
|
|
|
67 |
- Crop style: random
|
68 |
- Crop aspect: random
|
69 |
### movieposters
|
70 |
+
- Repeats: 5
|
71 |
- Total number of images: 1728
|
72 |
- Total number of aspect buckets: 3
|
73 |
- Resolution: 1.0 megapixels
|
|
|
75 |
- Crop style: random
|
76 |
- Crop aspect: random
|
77 |
### normalnudes
|
78 |
+
- Repeats: 5
|
79 |
- Total number of images: 1056
|
80 |
- Total number of aspect buckets: 3
|
81 |
- Resolution: 1.0 megapixels
|
|
|
91 |
- Crop style: random
|
92 |
- Crop aspect: random
|
93 |
### guys
|
94 |
+
- Repeats: 5
|
95 |
- Total number of images: 352
|
96 |
- Total number of aspect buckets: 3
|
97 |
- Resolution: 1.0 megapixels
|
|
|
107 |
- Crop style: random
|
108 |
- Crop aspect: random
|
109 |
### signs
|
110 |
+
- Repeats: 5
|
111 |
- Total number of images: 352
|
112 |
- Total number of aspect buckets: 3
|
113 |
- Resolution: 1.0 megapixels
|
|
|
140 |
- Crop aspect: random
|
141 |
### experimental
|
142 |
- Repeats: 0
|
143 |
+
- Total number of images: 3040
|
144 |
- Total number of aspect buckets: 3
|
145 |
- Resolution: 1.0 megapixels
|
146 |
- Cropped: True
|
|
|
148 |
- Crop aspect: random
|
149 |
### ethnic
|
150 |
- Repeats: 0
|
151 |
+
- Total number of images: 3072
|
152 |
+
- Total number of aspect buckets: 3
|
153 |
- Resolution: 1.0 megapixels
|
154 |
- Cropped: True
|
155 |
- Crop style: random
|
|
|
173 |
### architecture
|
174 |
- Repeats: 0
|
175 |
- Total number of images: 4324
|
176 |
+
- Total number of aspect buckets: 2
|
177 |
- Resolution: 1.0 megapixels
|
178 |
- Cropped: True
|
179 |
- Crop style: random
|
|
|
220 |
- Crop aspect: random
|
221 |
### yoga
|
222 |
- Repeats: 0
|
223 |
+
- Total number of images: 3584
|
224 |
- Total number of aspect buckets: 3
|
225 |
- Resolution: 1.0 megapixels
|
226 |
- Cropped: True
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 15406336826
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f20139f218525e7cbad6a58d990f3eb502f36b1a79054be50155bb747a9ec6fa
|
3 |
size 15406336826
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 14344
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:144c1fb57a49c438bb6e2fc69097c45c82e024a340bce368627684caf12e4769
|
3 |
size 14344
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1000
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:52ea9be04a7ddf81901a2d4dd9c15bebdb24a6ece94fe3f1432546bd67bf0bc6
|
3 |
size 1000
|
training_state-anatomy.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-architecture.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-bg20k-1024.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b91eec8b7de2e6f1ffaa95bd4db6b33bfb12066d75434c6452e62edb54073b59
|
3 |
+
size 15758639
|
training_state-cinemamix-1mp.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-ethnic.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-experimental.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-nsfw-1024.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-aesthetics.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f3c5d798ebd280ea0ff5c33a34e90409236fed4a94af59848ec85073b6d6f062
|
3 |
+
size 15385652
|
training_state-shutterstock.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-text-1mp.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-yoga.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 5100, "epoch_step": 166, "epoch": 1, "exhausted_backends": ["guys", "signs", "nijijourney", "propagandaposters", "bookcovers", "pixel-art", "normalnudes", "celebrities", "sports", "movieposters", "moviecollection", "gay", "ethnic", "experimental", "yoga"], "repeats": {"guys": 0, "signs": 0, "nijijourney": 0, "propagandaposters": 0, "bookcovers": 0, "pixel-art": 0, "normalnudes": 0, "celebrities": 0, "sports": 0, "movieposters": 0, "moviecollection": 0, "gay": 0, "ethnic": 0, "experimental": 0, "yoga": 0}}
|
unet/config.json
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
{
|
2 |
"_class_name": "UNet2DConditionModel",
|
3 |
"_diffusers_version": "0.27.2",
|
4 |
-
"_name_or_path": "/notebooks/datasets/models/checkpoint-
|
5 |
"act_fn": "silu",
|
6 |
"addition_embed_type": "text_time",
|
7 |
"addition_embed_type_num_heads": 64,
|
|
|
1 |
{
|
2 |
"_class_name": "UNet2DConditionModel",
|
3 |
"_diffusers_version": "0.27.2",
|
4 |
+
"_name_or_path": "/notebooks/datasets/models/checkpoint-5000",
|
5 |
"act_fn": "silu",
|
6 |
"addition_embed_type": "text_time",
|
7 |
"addition_embed_type_num_heads": 64,
|
unet/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5135151440
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:828dcb130273b61f3da58aecae178a60aefb5ac7a1f5e7f2e45cff2fe9a2fb3b
|
3 |
size 5135151440
|