PseudoTerminal X
commited on
Trained for 0 epochs and 5500 steps.
Browse filesTrained with datasets ['text-embeds-pixart-filter', 'photo-concept-bucket', 'moviecollection', 'experimental', 'ethnic', 'sports', 'architecture', 'shutterstock', 'cinemamix-1mp', 'nsfw-1024', 'anatomy', 'bg20k-1024', 'yoga', 'photo-aesthetics', 'text-1mp', 'movieposters', 'normalnudes', 'pixel-art', 'signs', 'midjourney-v6-520k-raw', 'sfwbooru', 'nijijourney-v6-520k-raw', 'dalle3']
Learning rate 1e-06, batch size 24, and 1 gradient accumulation steps.
Used DDPM noise scheduler for training with epsilon prediction type and rescaled_betas_zero_snr=False
Using 'linspace' timestep spacing.
Base model: ptx0/pixart-900m-1024-ft-large
VAE: madebyollin/sdxl-vae-fp16-fix
- README.md +6 -6
- optimizer.bin +1 -1
- random_states_0.pkl +1 -1
- scheduler.bin +1 -1
- training_state-anatomy.json +0 -0
- training_state-dalle3.json +0 -0
- training_state-midjourney-v6-520k-raw.json +0 -0
- training_state-nijijourney-v6-520k-raw.json +0 -0
- training_state-photo-concept-bucket.json +2 -2
- training_state-sfwbooru.json +0 -0
- training_state-text-1mp.json +0 -0
- training_state.json +1 -1
- transformer/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -77,7 +77,7 @@ You may reuse the base model text encoder for inference.
|
|
77 |
## Training settings
|
78 |
|
79 |
- Training epochs: 0
|
80 |
-
- Training steps:
|
81 |
- Learning rate: 1e-06
|
82 |
- Effective batch size: 192
|
83 |
- Micro-batch size: 24
|
@@ -167,7 +167,7 @@ You may reuse the base model text encoder for inference.
|
|
167 |
### anatomy
|
168 |
- Repeats: 5
|
169 |
- Total number of images: ~21056
|
170 |
-
- Total number of aspect buckets:
|
171 |
- Resolution: 1.0 megapixels
|
172 |
- Cropped: True
|
173 |
- Crop style: random
|
@@ -239,7 +239,7 @@ You may reuse the base model text encoder for inference.
|
|
239 |
### midjourney-v6-520k-raw
|
240 |
- Repeats: 0
|
241 |
- Total number of images: ~671104
|
242 |
-
- Total number of aspect buckets:
|
243 |
- Resolution: 1.0 megapixels
|
244 |
- Cropped: False
|
245 |
- Crop style: None
|
@@ -247,7 +247,7 @@ You may reuse the base model text encoder for inference.
|
|
247 |
### sfwbooru
|
248 |
- Repeats: 0
|
249 |
- Total number of images: ~423552
|
250 |
-
- Total number of aspect buckets:
|
251 |
- Resolution: 1.0 megapixels
|
252 |
- Cropped: False
|
253 |
- Crop style: None
|
@@ -255,7 +255,7 @@ You may reuse the base model text encoder for inference.
|
|
255 |
### nijijourney-v6-520k-raw
|
256 |
- Repeats: 0
|
257 |
- Total number of images: ~670976
|
258 |
-
- Total number of aspect buckets:
|
259 |
- Resolution: 1.0 megapixels
|
260 |
- Cropped: False
|
261 |
- Crop style: None
|
@@ -263,7 +263,7 @@ You may reuse the base model text encoder for inference.
|
|
263 |
### dalle3
|
264 |
- Repeats: 0
|
265 |
- Total number of images: ~1242072
|
266 |
-
- Total number of aspect buckets:
|
267 |
- Resolution: 1.0 megapixels
|
268 |
- Cropped: False
|
269 |
- Crop style: None
|
|
|
77 |
## Training settings
|
78 |
|
79 |
- Training epochs: 0
|
80 |
+
- Training steps: 5500
|
81 |
- Learning rate: 1e-06
|
82 |
- Effective batch size: 192
|
83 |
- Micro-batch size: 24
|
|
|
167 |
### anatomy
|
168 |
- Repeats: 5
|
169 |
- Total number of images: ~21056
|
170 |
+
- Total number of aspect buckets: 1
|
171 |
- Resolution: 1.0 megapixels
|
172 |
- Cropped: True
|
173 |
- Crop style: random
|
|
|
239 |
### midjourney-v6-520k-raw
|
240 |
- Repeats: 0
|
241 |
- Total number of images: ~671104
|
242 |
+
- Total number of aspect buckets: 15
|
243 |
- Resolution: 1.0 megapixels
|
244 |
- Cropped: False
|
245 |
- Crop style: None
|
|
|
247 |
### sfwbooru
|
248 |
- Repeats: 0
|
249 |
- Total number of images: ~423552
|
250 |
+
- Total number of aspect buckets: 35
|
251 |
- Resolution: 1.0 megapixels
|
252 |
- Cropped: False
|
253 |
- Crop style: None
|
|
|
255 |
### nijijourney-v6-520k-raw
|
256 |
- Repeats: 0
|
257 |
- Total number of images: ~670976
|
258 |
+
- Total number of aspect buckets: 11
|
259 |
- Resolution: 1.0 megapixels
|
260 |
- Cropped: False
|
261 |
- Crop style: None
|
|
|
263 |
### dalle3
|
264 |
- Repeats: 0
|
265 |
- Total number of images: ~1242072
|
266 |
+
- Total number of aspect buckets: 2
|
267 |
- Resolution: 1.0 megapixels
|
268 |
- Cropped: False
|
269 |
- Crop style: None
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5451415117
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1e9674252ce4ece4f58d923749b69f9c928be43818c0ca98efb6f3ec006138a7
|
3 |
size 5451415117
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 16100
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8726f0b286825c7d70d4a5412b3defa948c273f0120759f6028918daf1308e3e
|
3 |
size 16100
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1000
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4b4f24f412eb316b43c6fca6b8eb89aa2d2d44fb4b2dee7fb2dec4caaeb101c8
|
3 |
size 1000
|
training_state-anatomy.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-dalle3.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-midjourney-v6-520k-raw.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-nijijourney-v6-520k-raw.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:737efa0eb772e42d8c1dc7d6f75cf690bff607c33cec78f4300f868140a780b3
|
3 |
+
size 6166382
|
training_state-sfwbooru.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-text-1mp.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 5500, "epoch_step": 1, "epoch": 1, "exhausted_backends": ["signs", "nijijourney", "pixel-art", "propagandaposters", "sports", "gay", "experimental", "yoga", "ethnic", "cinemamix-1mp", "architecture", "mj-60", "normalnudes", "bookcovers", "celebrities", "movieposters", "shutterstock", "nsfw-1024", "photo-aesthetics", "moviecollection", "bg20k-1024"], "repeats": {"bookcovers": 0, "signs": 0, "normalnudes": 0, "nijijourney": 0, "movieposters": 0, "celebrities": 0, "pixel-art": 0, "propagandaposters": 0, "sports": 0, "moviecollection": 0, "gay": 0, "experimental": 0, "yoga": 0, "ethnic": 0, "cinemamix-1mp": 0, "architecture": 0, "mj-60": 0, "text-1mp": 12, "shutterstock": 0, "nsfw-1024": 0, "photo-aesthetics": 0, "anatomy": 4, "bg20k-1024": 0}}
|
transformer/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1816969728
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:21dae37b18a64da5a50bba5e84518ed7afff741cfb40d5aba7ad224c39dfe3ef
|
3 |
size 1816969728
|